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Abstract. We prove a density version of the Carlson— Simpson Theorem. 
Specifically we show the following. 

For every integer k ^ 2 and every set A of words over k satisfying 

km sup > 

n->oo k n 

there exist a word c over k and a sequence (ui n ) of left variable words over k 
such that the set 

{c} U {c~uio(ao)~ ...~u> n (a n ) : n £ N and ao, ...,a„ 6 [k]} 

is contained in A. 

While the result is infinite-dimensional its proof is based on an appropriate 
finite and quantitative version, also obtained in the paper. 
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1. Introduction 



1.1. Overview. Our topic is Ramsey Theory, the general area of Combinatorics 
that studies the basic pigeonhole principles of discrete structures and organizes, in 
a systematic way, the results obtained by iterating them. 

1.1.1. The coloring versions. The first pigeonhole principle relevant to our dis- 
cussion in this paper is the Hales- Jewett Theorem [20]. To state it we need to 
introduce some pieces of notation and some terminology. For every integer k 2 
let [k] <N be the set of all finite sequences having values in [k] :— {1, ...,&}. The 
elements of [fc] <N are referred to as words over k, or simply words if k is understood. 
If n G N, then [k] n stands for the set of words of length n. We fix a letter v that 
we regard as a variable. A variable word over k is a finite sequence having values 
in [k] U {v} where the letter v appears at least once. If w is a variable word and 
a G [k], then w(a) is the word obtained by substituting all appearances of the letter 
j; in to by o. A combinatorial line of [k] n is a set of the form {w(a) : a G [k]} where 
w is a variable word over k of length n. 

Hales— Jewett Theorem. For every k, r G N with k ^ 2 and r ^ 1 there exists an 
integer N with the following property. If n N, then for every r-coloring of [k] n 
there exists a combinatorial line of [k] n which is monochromatic. The least integer 
N with this property will be denoted by HJ(fc,r). 

The Hales- Jewett Theorem is the bread and butter of Ramsey Theory and is 
often regarded as an abstract version of the van der Waerden Theorem 38J. The 
exact asymptotics of the numbers HJ(/c, r) are still unknown. The best known 
upper bounds are primitive recursive and are due to S. Shelah [33]. 

The second pigeonhole principle relevant to our discussion is the Halpern-Lauchli 
Theorem [21], a rather deep result that concerns partitions of finite products of 
infinite trees. 

Halpern Lauchli Theorem. For every finite tuple (T%, ...,Td) of uniquely rooted 
and finitely branching trees without maximal nodes and every finite coloring of the 
level product 



of (Ti, Td) there exist strong subtrees (Si,...,Sd) of (Ti, ...,Td) having common 
level set such that the level product of (Si, Sd) is monochromatic. 

We recall that a subtree S of a tree (T, <) is said to be strong if: (a) S is 
uniquely rooted, (b) there exists an infinite subset Lt{S) = {lo < h < ...} of N, 
called the level set of S, such that for every n G N the n- level S(n) of S is a subset 
of T(l n ), and (c) for every s G S and every immediate successor t of s in T there 
exists a unique immediate successor s' of s in S with t $J s' . The notion of a 
strong subtree was highlighted with the work of K. Milliken [25] [26] who used the 
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Halpern-Lauchli Theorem to show that the family of strong subtrees of a uniquely 
rooted and finitely-branching tree is partition regular. 

The Hales- Jewett Theorem and the Halpern-Lauchli Theorem are pigeonhole 
principles of quite different nature. Nevertheless, they do admit a common exten- 
sion which is due to T. J. Carlson and S. G. Simpson [6]. To state it we recall that 
a left variable word over k is a variable word over k whose leftmost letter is the 
variable v. The concatenation of two words x and y over k is denoted by x^y. 

Carlson— Simpson Theorem. For every integer k ^ 2 and every finite coloring 
of the set of all words over k there exist a word c over k and a sequence (w n ) of 
left variable words over k such that the set 

(1.2) {c} U {c^Wo(ao)~ ...^w n (a n ) : n E N and oq, a n E [k] } 

is monochromatic. 

The Carlson-Simpson Theorem belongs to the circle of results that provide in- 
formation on the structure of the wildcard^ set of the variable word obtained by 
the Hales-Jewett Theorem; see, e.g., [3J [221 12U [Ml HO]- This extra information 
(namely, that the sequence (w n ) consists of left variable words) can be used to 
derive the Halpern-Lauchli Theorem when the trees 2i,...,T<j are homogeneous^, 
a special case which is sufficient for all known combinatorial applications of the 
Halpern-Lauchli Theorem (see [29 ). 

1.1.2. The density versions. It is a remarkably fruitful phenomenon that many 
pigeonhole principles have a density version. These density versions are strength- 
enings of their coloristic counterparts and assert that every large subset of a "struc- 
ture" must contain a "substructure" . In fact, the first pigeonhole principle we dis- 
cussed so far, namely the Hales-Jewett Theorem, admits a density version which 
is due to H. Furstenberg and Y. Katznelson [To] . 

Density Hales— Jewett Theorem. For every integer k > 2 and every < S ^ 1 
there exists an integer N with the following property. If n ^ AT, then every subset 
A of [k] n with \ A\ ^ Sk n contains a combinatorial line of [k] n . The least integer N 
with this property will be denoted by DHJ(/c,5). 

The density Hales-Jewett Theorem is a fundamental result of Ramsey Theory. 
It has several strong results as consequences, most notably the famous Szemeredi 
Theorem on arithmetic progressions [3S] and its multidimensional version [T3] . The 
best known upper bounds for the numbers DHJ(fc, S) are obtained in [28] and have 
an Ackermann-type dependence with respect to k. 

^We recall that if w = (u>i)"~Q is a variable word over k of length n, then its wildcard set is 
defined to be the set {i g {0, ...,n — 1} : ix>i = v}. 

2 A tree T is homogeneous if it is uniquely rooted and there exists an integer 6^2 such that 
every t £ T has exactly b immediate successors; e.g., every dyadic, or triadic tree is homogeneous. 
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It turns out that the Halpern-Lauchli Theorem also has a density version that 
was obtained relatively recently in [8]. 

Density Halpern-Lauchli Theorem. For every finite tuple (Ti,...,Td) of ho- 
mogeneous trees and every subset A of the level product of (T±, T d ) satisfying 

n r \An(T 1 (n)x...xT d (n))\ 

(1-3) hmsup , \ , vT , > 

n-^oo \Ti(n) x ... x T d (n)\ 

there exist strong subtrees (Si, S d ) of (T\, ...,Td) having common level set such 
that the level product of (Si, S d ) is a subset of A. 

We should point out that the assumption in the above result that the trees 
Ti,...,Td are homogeneous is not redundant. On the contrary, various examples 
given in [3] show that it is essentially optimal. 

1.2. The main results. In view of the above it is natural to ask whether the 
Carlson-Simpson Theorem has a density analogue which would extend, among 
others, both the density Hales- Jewett Theorem and the density Halpern-Lauchli 
Theorem. Our goal in this paper is to answer this question affirmatively Specifically 
we show the following theorem. 

Theorem A. For every integer k ^ 2 and every set A of words over k satisfying 

(1.4) limsup - rr^- 1 > 

n— >oo rv 

there exist a word c over k and a sequence (ui„) of left variable words over k such 
that the set 

(1.5) {c} U {c^ wo(ao)^ ...~w n (a n ) : n G N and do, a n 6 [£;]} 
is contained in A. 

The proof of Theorem A follows a strategy that was already applied in a closely 
related context and was described in some detail in [TIB §7]. It consists of reducing 
Theorem A to an appropriate finite version. This finite version, which represents 
the combinatorial core of Theorem A, is the content of the following theorem which 
is the second main result of the paper. 

Theorem B. For every integer k 2, every integer m 1 and every < i5 ^ 1 
there exists an integer N with the following property. If L is a finite subset of N of 
cardinality at least N and A is a set of words over k satisfying \A(~) [k] n \ ^ Sk n for 
every n 6 L, then there exist a word c over k and a finite sequence (w h )™Zq of left 
variable words over k such that the set 

(1.6) {c} U {c~u>o(ao)~ ...~u>n(a n ) : n € {0, ...,m— 1} and a ,...,a n G [k]} 

is contained in A. The least integer N with this property will be denoted by 
DCS(fc,m,(5). 
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The main point in Theorem B is that the result is independent of the position 
of the finite set L. Its proof is based on a density increment strategy - a powerful 
method pioneered by K. F. Roth |31j - and yields explicit upper bounds for the 
numbers DCS(/c, m, S). These upper bounds arc admittedly rather weak. They are 
in line, however, with several other bounds obtained recently in the area; see, e.g., 

da eei M- 

Although Theorem B refers to left variable words, it can be used to obtain 
variable words with quite divergent structure. Specifically, given two sequences 
(p n ) and (w n ) of variable words over k, we say that the sequence (w n ) is of pattern 
(p n ) if p n is an initial segment of w n for every neN. So, for instance, if q n = (v) 
for every n E N, then a sequence (w n ) of variable words over k is of pattern (q n ) if 
and only if it consists of left variable words. We show the following theorem. 

Theorem C. Let k E N with k ^ 2 and {p n ) be an arbitrary sequence of variable 
words over k. Then for every set A of words over k satisfying 

(1.7) HmsupMl > o 

n— >oo rv 

there exist a word c over k and a sequence (w n ) of variable words over k of pattern 
(p n ) such that the set 

(1.8) {c} U {c^wofao)^ ..."w n (a n ) : n E N and ao, a n E [fc]} 
is contained in A. 

Of course, there is also a finite version of Theorem C in the spirit of Theorem 
B. This is the content of Theorem II 1.11 in the main text. 

1.3. Structure of the paper. The paper is organized as follows. In §2 we set 
up our notation and terminology, and we recall some tools needed for the proof of 
the main results. Of particular importance is the notion of a Carlson-Simpson tree 
introduced in §2.5. It is the analogue, within the context of left variable words, of 
the notion of a combinatorial subspace. 

The next four sections contain several preparatory results needed for the proof 
of Theorem B. This material is not only independent of the rest of the paper but 
also of independent interest. In §3 we state and prove a "regularity lemma" for 
subsets of [fc] <N . The lemma asserts that every dense subset of [k] <N is inherently 
pseudorandom and is proved via an energy increment strategy, an influential method 
introduced by E. Szemeredi 37 . In the next section, §4, we present a partition 
result for Carlson-Simspon trees which is, essentially, a variant of the classical 
Graham-Rothschild Theorem [15]. Finally, in §5 and §6 we develop a method of 
"gluing" a pair x and y of words over k. The method can be thought of as a natural 
extension of the familiar practice of concatenating x and y. It is encoded by what 
we call a convolution operation which is introduced and studied in §5. Iterations of 
convolution operations are studied in §6. We emphasize that the results in §6 are 
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invoked only in §9. However, the material in §3, §4 and §5 is heavily used and the 
reader is advised to gain some familiarity with the contents of these sections before 
reading the rest of the paper. 

The next four sections are devoted to the proof of Theorem B. The results in 
§7 are independent of the rest of the argument. In particular, this section can be 
read separately. The main part of the proof is contained in §8 and §9. The reader 
will find a detailed outline and an exposition of the key ideas in §8.1 and §9.1. The 
proof of Theorem B is completed in §10. 

Finally, the last section of the paper contains a discussion on some consequences 
of Theorem B, including the proofs of Theorem A and Theorem C. 

2. Background material 

ByN = {0,1, 2,...} we shall denote the natural numbers. For every integer n ^ 1 
we set [n] — {1, ...,n}. If X is a nonempty finite set, then by E^gx we shall denote 
the average jjfyXLex where, as usual, \X\ stands for the cardinality of X. For 
every function / : N ->• N and every I G N by : N -> N we shall denote the £-th 
iteration of / defined recursively by the rule f^°\n) — n and f^ +1 \n) = /(/^(n)) 
for every n G N. 

Let X be a nonempty (possibly infinite) set and A be a subset of X . For every 
nonempty finite subset Y of X the density of A in Y is defined by 

(2.1) dcnSy(A )^^J2p. 

If it is clear from the context to which set Y we are referring to (for instance, if 
Y coincides with X), then we shall drop the subscript Y and we shall denote the 
above quantity simply by dens (A). 

2.1. Words. For every k G N with k > 2 and every n E N let [k] n be the set of all 
sequences of length n having values in [k]. Precisely, [k]° contains just the empty 
sequence while if n 1, then 

(2.2) [k] n = {(s ,...,«n-i) : *i G [k] for every i G {0, n - 1}}. 
Also let 

(2.3) [k]< n = (J [k]\ 

{ien-.i<n} 

Notice, in particular, that [k] <0 is empty. We set 

(2.4) [fc]< N = |J [k]\ 

riGN 

The elements of [fc] <N are called words over k, or simply words if k is understood. 
The length of a word x over k, denoted by \x\, is defined to be the unique natural 
number n such that x G [k] n . For every i G N with i < |x| by x\i we shall denote 
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the word of length i which is an initial segment of x. The concatenation of two 
words x, y will be denoted by x^y. 

2.2. Located words. For every k G N with k ^ 2 and every (possibly empty) 
finite subset J of N by [k] J we shall denote the set of all functions from J into [k] . 
An element of the set 

(2.5) |J [k] J 

JCN finite 

will be called a located word over k. If x G [k] J is a located word over k and S is 
a subset of J, then x\s stands for the restriction of x on S; notice that x\s G [k] s . 
Moreover, for every x G [k] 1 and every y G [fc] J , where 7 and J are two finite 
subsets of N with 7 n J = 0, by (a;, y) we shall denote the unique element z of 
[fc] /uJ satisfying z\i = x and z\j = y. 

Of course, every word over k is a located word over k. Indeed, notice that 

(2.6) [ fc ]{meN:m<n} = 

for every n G N. Conversely, we may identify located words over k with words over 
k as follows. Let J be a nonempty finite subset of N. We set j = \ J\ and we write 
the set J in increasing order as {no < ... < nj-i}. The canonical isomorphism 
associated to J is the bijection Ij : [fcp — > [k] J defined by the rule 

(2.7) lj(x)(m) = x(i) 

for every i G {0, — 1}. Observing that [k] = [k}° = {0}, we define the 
canonical isomorphism I associated to the empty set to be the identity. 

2.3. Variable words. Let k, m G N with k ^ 2 and m ^ 1, and fix a tuple 
Wo,...,w TO _i of distinct letters. An m-variable word over k is a finite sequence 
having values in [k] U {v n , u m _i} such that: (a) for every i G {0, m — 1} the 
letter appears at least once, and (b) if m ^ 2, then for every i, j G {0, m — 1} 
with i < j all occurrences of precede all occurrences of Vj . For every m-variable 
word w over k and every a ,...,a m _i G [fc] by w(a , a m _i) we shall denote the 
unique word over k obtained by substituting in w all appearances of the letter Vi 
with ai for every i G {0, ...,m — 1}. A ie/t variable word over k is an 1-variable 
word over k whose leftmost letter is the variable v. 

2.4. Combinatorial subspaces. Let k,m G N with k ^ 2 and m ^ 1. An 

m- dimensional combinatorial subs-pace of [fc] <N is a set of the form 

(2.8) V = {w(a ,...,a m - 1 ) : a ,...,« m _i G [fc]} 

where w is an m-variable word over k. The 1-dimcnsional combinatorial subspaces 
are called combinatorial lines. 

For every m-dimensional combinatorial subspace V of [fc] <N and every t G [m] let 
Subsf (V) be the set of all ^-dimensional combinatorial subspaces of [fc] <N which are 
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contained in V. We will need the following special case of the Graham-Rothschild 
Theorem [19] . 

Theorem 2.1. For every integer k ^ 2, every pair of integers d ^ m ^ 1 and every 
integer r ^ 1 there exists an integer N with the following property. Ifn^N and V 
is an n- dimensional combinatorial subspace o/[/c] <N , then for every r- coloring of the 
set Subs m (V r ) there exits W G Subs ( j(V r ) such that the set Subs m (W) is monochro- 
matic. The least integer N with this property will be denoted by GR(fc, d, m, r). 

Detailed expositions as well as infinite extensions of Theorem 12.11 can be found 
in various places in the literature; see, e.g., [H [5J Q31 H31 [H]. Also we remark that 
there exist primitive recursive upper bounds for the numbers GR(fc, d,m,r) which 
are due to S. Shelah [33] . 

2.5. Carlson— Simpson trees. We are about to introduce a family of combinato- 
rial objects which will be of particular importance throughout the paper. 

Definition 2.2. Let k € N with k ^ 2. A Carlson-Simpson tree of [k] <N is a set 
of the form 

(2.9) W = {c} U {c^wo(ao)^ ..."w„(o n ) : n G {0, m — 1} and ao, a n G [fc]} 

where c is a word over k and {w n )™~Q is a nonempty finite sequence of left variable 
words over k. 

It is easy to see that the sequence (c, wq, w m -x) that generates a Carlson- 
Simpson tree W via formula (|2.9|) is unique. This unique sequence will be called 
the generating sequence of W . The corresponding natural number m will be called 
the dimension of W and will be denoted by dim(W / ). The 1-dimensional Carlson- 
Simpson trees will be called Carlson-Simpson lines. 

Let W be an m-dimensional Carlson-Simpson tree of [k] <n and (c, Wq, u> m -i) 
be its generating sequence. The 0-level W(0) of W is defined by 

(2.10) W(0) = {c}. 

Observe that W(0) is contained in [k] e ° where £q is the length of c. Moreover, for 
every n G [m] the n-level W(n) of W is defined by 

(2.11) W(n) = {c"!Oo(ao)"..."w n -i(a„-i) : a , a„_i G [A;]}. 

Notice that W(n) is an n-dimensional combinatorial subspace of [fc] <N and is con- 
tained in [k] in where l n is the sum of the lengths of c, Wo, w n —\. The set 
{£ < ... < £ m } will be called the level set of W and will be denoted by L(W). 

For every m-dimensional Carlson-Simpson tree W of [fc] <N and every I G [m] 
by Subtr^(Fy) we shall denote the set of all ^-dimensional Carlson-Simpson trees 
of [£:] <N which are contained in W. An element of Subtr^(W / ) will be called an 
l-dimensional Carlson-Simpson subtree of W, or simply Carlson-Simpson subtree 
of W if the dimension £ is understood. 
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The archetypical example of a Carlson-Simpson tree of [fc] <N of dimension to is 
the set [fc] <m+1 . In fact, every Carlson-Simpson tree of dimension m can be thought 
of as a "copy" of [fc] <m+1 inside [fc] <N . Specifically, let W be an m-dimensional 
Carlson-Simpson tree of [fc] <N and (c, wq, w m -i) be its generating sequence. The 
canonical isomorphism associated to W is the bijection lw '■ [k] <m+1 — > W defined 
by lw(0) = c an d 

(2.12) l w ((a ,:.,a n -x)) = c~w (a )~...~w n -i(a n -x) 

for every n £ [m] and every (ao, a n -l) G W\ n ■ The canonical isomorphism lw 
preserves all structural properties one is interested in while working in the cate- 
gory of Carlson-Simpson trees. For instance, if £ £ [to] and V is a Carlson-Simpson 
subtree of [fc] <nl+1 of dimension t, then its image Ijy (V) under the canonical isomor- 
phism is an ^-dimensional Carlson-Simpson subtree of W . Thus, for most practical 
purposes, we may identify W with [fc] <m+1 via the canonical isomorphism lw- 

More generally, let W and U be two Carlson-Simpson trees of [k] <N of the same 
dimension. The canonical isomorphism associated to the pair W, U is the bijection 
lw,u :W —tU defined by the rule 

(2.13) lw,u(t) = Qv °Iw)(f) 

where lw and ljj are the canonical isomorphisms associated to W and U. Of course, 
the map lw,u will be used to transfer information from W to U and vice versa. 

Finally, for every m-dimensional Carlson-Simpson tree W of [k] <Ti and every 
k! e {2, k} we define the k' -restriction W \ k' of W to be the set 

(2.14) {c} U {c^w (aQ)^ ,..~w n (a' n ) : n 6 {0,...,m — 1} and a' ,...,a' n € [A;']} 

where (c,wq, ...,w m -i) stands for the generating sequence of W. Notice that the 
canonical isomorphism of W maps [£;'] <m+1 onto W \ k' . Therefore, W \ k' can be 
naturally identified as a Carlson-Simpson tree of [&/] <N . 

2.6. Insensitive sets. Let k 6 N with k ^ 2 and x, y be two words over k. Also 
let i,j € [k] with i ^ j. We say that x and y are (i, j)- equivalent if: (a) x and y 
have common length, and (b) if n is the common length of x and y, then for every 
s £ [k] \ {i, j} and every r £ N with r < n we have x(r) — s if and only if y(r) = s. 

If n £ N and A is a subset of [fc] n , then A is said to be (i, j) -insensitive if for 
every x £ A and every y £ [k] n if x and y are (i, j)-equivalent, then i/ 6 A. The 
notion of an (i, j)-insensitive set was introduced by S. Shelah [33] and highlighted in 
the polymath proof 28J of the density Hales- Jewett Theorem. It can be naturally 
extended to subsets of [fc] <N as follows. 

Definition 2.3. Let k £ N with k ^ 2 and i,j £ [k] with i ^ j. Also let A be a 
subset of [k] <N . We say that A is (i, j)-insensitive if for every n £ N the set A(l [k] n 
is (i, j)-insensitive. If W is Carlson-Simpson tree of [k\ <n , then we say that A is 
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(i, j) -insensitive in W ifI w (AC\ W) is an (i, j) -insensitive subset of [fc] <N where 
Iw is the canonical isomorphism associated to W . 

It is easy to see that the family of all (i, j)-insensitive subsets of [fc] <N is closed 
under intersections, unions and complements. The same remark, of course, applies 
to the family of all (i, j)-insensitive sets in a Carlson-Simpson tree W of [fc] <N . 

2.7. Furstenberg Weiss measures. Let k, m G N with k ^ 2 and m ^ 1. The 

Furstenberg-Weiss measure d™ w associated to [k] <m+1 is the probability measure 
on [fc] <N defined by 

(2.15) dgV(A) =E n6{0l ... lTO} dens w »(A). 

This class of measures was introduced by H. Furstenberg and B. Weiss [TB] and has 
proven to be useful in various problems in Ramsey Theory (see, e.g., [9] [27]). We 
will need the following two variants. 

Definition 2.4. Let k G N with k > 2. 

(i) For every Carlson-Simpson tree W of [fc] <N the Furstenberg-Weiss measure 
dpw associated to W is the probability measure on [fc] <N defined by 

(2-16) d^v(A) =E ne{0 ,...,dim(w)}dens W („)(A). 

(ii) For every nonempty finite subset LofN the generalized Furstenberg-Weiss 
measure &l associated to L is the probability measure on [k] <N defined by 

(2.17) d L {A) =E„ eL dens [fe] „( J 4). 

It is, of course, clear that if L is an initial interval of N of cardinality I ^ 2, then 
the generalized Furstenberg-Weiss measure d^ associated to L coincides with the 
Furstenberg-Weiss measure dp^J- 

2.8. Probabilistic preliminaries. We record, for future use, three probabilistic 
facts. The first one is an immediate consequence of Markov's inequality. 

Lemma 2.5. Let (f2, E,/x) be a probability space and < 5 ^ 1. Also let (^)™ =1 
be a finite family of measurable events in (Q, E,/i) such that fi(Ai) <5 /or every 
i G [n]. Then, setting L u = {i G [n] : w G A^} /or every ui G f2, we /iciue 

(2.18) ^({w : > (*/2)n}) > <V 2 - 

Proof. For every i G [n] let 1^ be the indicator function of the event Ai and set 
Z=\ YTi=x lAi- Then E[Z] ^ <5 and the result follows. □ 

To state the second result we recall that if (Q, E,/i) is a probability space and 
Y G E with /x(Y) > 0, then /xy stands for the conditional probability measure of /i 
relative to Y defined by 

MX, - « 

for every A G E. 
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Lemma 2.6. Let (fi, E, fj,) be a probability space and < A,/3,e ^ 1. Let A and 
B be two measurable events in (CI, E, fi) with A C B and such that (i(A) ^ Xfi(B) 
and n(B) ^ (3. Suppose that Q — (Q;)™ =1 is a finite family of pairwise disjoint 
measurable events in (J7, E,/x) such that fi(B\UQ) ^ s.p/2 and /i(Q;) > for every 
i 6 [n] . Then, setting 

(2.20) I={i£ [n] : MQi (A) ^ (A — e)/j,Q i (B) and fi Qz {B) ^ pe/4), 
we have 

(2.21) $>(<9i) >Pe/4. 

In particular, if fx(Qi) — ^{Qj) for every i,j £ [n], then \I\ ^ (/3e/4)n. 
Proof. Notice, first, that n(A \ UQ) ^ e/3/2. This is easily seen to imply that 

(2.22) ^ M(j3 ) > " £ / 2 - 



i=l 



For every i £ [n] let a, = [i Ql {A)/nQ i (£?), bi = hq z {B) and = ^(Qi)/n(B) with 
the convention that <Zj = if the set B (~\Qi happens to be empty. Then inequality 
(|2.22[) can be reformulated as 

n 

(2.23) ^a^Q > A-e/2. 

i=l 

Notice that 

n n ^ 

(2.24) ^ b lCl ^ 1 and ^ c, ==: -. 

i=l i=l " 

Also observe that / = {i £ [n] : > A — e and ^ /3e/4}. Since ^ 0^,6^ ^ 1 
for every i £ [n], combining (I2.23[) , (|2.24p and the previous remarks, we see that 
J2iei Ci ^ e /4 an d the proof is completed. □ 

The final result of this subsection is the following. 

Lemma 2.7. Let < 6 < e ^ 1 and n £ N wzt/i n ^ (e 2 - 2 ) -1 . // (Ai)f =1 is a 
family of measurable events in a probability space (f2, E, /1) satisfying fJ.(Ai) ^ e for 
every i £ [n], then there exist i,j £ [n] with i ^ j such that fi(Ai fl Aj) ^ 2 . 

Proof. We set AT = Y^i=i wnere 1a ; is the indicator function of the event Ai 
for every i £ [n]. Then E[A"] ^ en so, by convexity, 

(2.25) v(A i nA j )=W L X(X-l)]>en{En-l). 
ie[n] j£[n]\{i} 

Therefore, there exist i,j £ [n] with i 7^ j such that /x(Ai n A,-) ^ 2 . □ 
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3. A REGULARITY LEMMA FOR SUBSETS OF [fc] <N 

3.1. Statement of the main result. Our goal in this section is to prove a 
"regularity lemma" for subsets of [fc] <N . Roughly speaking, the lemma asserts that 
if n is large enough and A is a subset of [k] n , then we may find a set of coordinates 
/ C {to g N : m < n] of preassigned cardinality such that the set A, viewed as a 
subset of the product [k] 1 x [/ c ]{ m e N:m <™}\ / j behaves like a randomly chosen set. 

To put things in a proper perspective we need, first, to determine the kind of 
randomness we are referring to. This is the content of the following definition. 

Definition 3.1. Let k G N with k ^ 2 and J 7 be a family of subsets of [fc] <N . Also 
let < e ^ 1 and L be a nonempty finite subset of N. The family T will be called 
(e, i)-regular provided that for every A G J- , every n G L, every (possibly empty) 
subset I of {I G L : I < n} and every y G [k] 1 we have 

(3.1) |dens({ii; G [jfc]{™eN:m<n}\J . ( y ^ w ) G An [*]"}) - dens(A n [fc] n )| < e. 

Notice that for every y G [k] 1 the set {w G [fc]{™eN:m<n}\/ . e ^ n i s 

just the section A n [k] n at y. So what Definition 13.11 guarantees is that for every 
n G L and every / C {I g L : I < n} the density of the sections of A n [fe]" along 
elements of [k] 1 are essentially equal to the density of A D [fe] n . 

We are now ready to state the main result of this section. 

Lemma 3.2. For every < e ^ 1 and every k,£,q G N with k 2 and £, q ^ 1 
f/iere exists an integer n with the following property. If N is a finite subset of N 
with \N\ ^ n and J 7 is a family of subsets of [k] <N with IT 7 ) = q, then there exists 
a subset L of N with \L\ — I such that T is (e, L)-regular. The least integer n with 
this property will be denoted by Heg(k,£,q,e). 

The proof of Lemma 13.21 will be given in §3.2. It is based on an energy incre- 
ment strategy, a powerful method introduced by E. Szemeredi in his proof of the 
celebrated regularity lemma [37] . The argument is, of course, effective and yields 
explicit upper bounds for the numbers Keg(k,£,q, e). 

3.2. Proof of Lemma 13.21 We begin with the following definition which is the 
most important ingredient of the proof. 

Definition 3.3. Let k,n G N with k ^ 2. Also let I be a (possibly empty) subset 
of {to G N : m < n}. For every subset A of [k] n we define the energy of A with 
respect to I to be the quantity 

(3.2) e I {A)=E ye[kV dcns(A v ) 2 

where A y = {w G [fc]{™eN:m<n}\/ . ^ ^ £ A y lg the section f A at y . 

We will isolate some basic properties of the energy which are needed for the 
proof. To this end, we need to introduce some pieces of notation. Specifically, let 
k, n G N with k ^ 2 and /, J be two subsets of {to G N : m < n} with I n J = 0. 
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We set M = {rn £ N : m < n} \ (I U J). If we are given a subset A of [k] n , then 
we may view the set A as a subset of the product [k] 1 x [k] J x [k] M and so we may 
define the section A^ y ^ = {v £ [fc] M : (y,z,v) £ A} for every (y, z) e [fc] 7 x [k] J . 
Notice that 

(3.3) dens(Ay) = E z6 [ fc ]jdens(A( yz )) and dens(A z ) = E ye t k ydens(A( y ^) 
for every y £ [k] 1 and every z £ [k] J . We have the following. 

Fact 3.4. Let k,n £ N with k ^ 2. Also let I be a subset of {m G N : m < n}. 
Then for every subset A of [k] n we have that &i(A) 1. Moreover, if J is a subset 
of {m £ N : m < n} with I D J = , then 

(3.4) e/uj(^) - ej(A) = E ze[k yE ye[k y (dens(A ( ^ z) ) - E ye[k ydens(A {ytZ) )J . 
In particular, ej(A) ^ enjj(A). 

Proof. The fact that ei(A) ^ 1 follows immediately by Definition ^. 31 Observe that 

(3.5) e IUJ (A) = E 2e[fe] j(E,, e[fc] rdens(,4 ( ,,, z) ) 2 ) 
and 

(3.6) ej(A) = 3 E ze[k] j(E ye[k] idens(A (y<z) fj . 

Combining (|3.5j) and (|3.6j) the result follows. □ 
The first step towards the proof of Lemma 13.21 is the following. 



Sublemma 3.5. Let k,n £ N with k ^ 2. ^4Zso ?e£ I and J be two subsets of 
{m £ N : m < n} with If] J = 0. Finally let Abe a subset of [k] n and < e < /c~' 7 ' . 
Ifejuj(A) - ej(A) < e 4 , then 

(3.7) dens({z £ [k] J : |dens(A (!yiZ )) - dens(,4 z )| e for every y £ [k] 1 }^ > 1 - e. 

Proof. We set Y = [k] 1 and Z = [k] J . For every z £ Z let / 2 : Y [0, 1] be the 
random variable defined by f z (y) — dens(^4( y>z )). Let E(f z ) = E ye y/ Z (?/) be the 
expected value of f z and Var(/ Z ) = E(f z ) — E(f z ) 2 be its variance. Notice that 
E(f z ) = dens{A z ). By ([3T4]) . we see that E zeZ Var(/ z ) = e Iu j(A) - ej(A). Hence, 
by our assumptions, we have 

(3.8) E zeZ Var(/ z ) < e 4 
and so, by Markov's inequality, 

(3.9) dens({z £ Z : Var(/ Z ) e 3 }) > 1 - e. 

Fix zo £ Z with Var(/ Z[) ) ^ e 3 . By Chebyshev's inequality, we have 

(3.10) dens({y £ Y : \f Zo {y) - E(f z<) )\ < e}) ^ 1 - e 
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and since e < we get that \fz (y) — E(f Zo )\ < e for every y G Y. This is 

equivalent to say that |dens(A(j / 2o )) — dens(A Zo )| ^ e for every y G [k] 1 and the 
proof is completed. □ 



Sublemma 13.51 will be used in the following form. 

Corollary 3.6. Let k,n G N with k ^ 2. Also let I and J be two subsets of 
{to G N : to < n} within J = 0. Finally let A be a subset of[k] n andO < e < AH j I. 
If eiuj(A) - ej(A) < e 4 /16, t/ien |dens(A y ) - dens(A)| < e for every y G [k] 1 . 

Proof. We set e = e/2 and 

(3.11) Z = {ze [k] J : \dens(A iy<z) ) - dens(A z )| < e Q for every y G [k] 1 }. 
By Sublemma 13.51 we have dens([fc] J \ Zo) ^ Eq. Hence, for every y G [k] 1 , 

(3.12) \dens(A y ) - dens(A)| K ze[k]J |dens(A (yiZ) ) - dens(A z )| 

E zeZo |dens(yl( ?/ ^)) - dens(,4 z )| + e 

s$ e + e = e 

as desired. □ 
We proceed to the second step of the proof of Lemma [ 



Sublemma 3.7. Let k,m,q G N with k ^ 2 and q ^ 1 and < e < k~ m . Also 
let N be a finite subset ofN with \N\ ^ (g[16£~ 4 J + 1)to + 1 and J- be a family of 
subsets o/ [fc] max ( A ') with \ J- \ = q. Then, setting N' — N \ {max(TV)}, there exists a 
subinterval M of N' (i.e., M is of the form J Pi N' for some interval J ofN) with 
\M\ = to and such that for every A G T , every subset I of M and every y G k 1 we 
have | dens (A,) — dens (A) ^ e. 

Proof. Clearly we may assume that m ^ 1. We set r — q[16e^ A \ + 1. Write the 
first ro • to elements of N' in increasing order as {no < n\ < . . . < n ro . m _i}. For 
every p G {0, r — 1} let 

(3.13) I p = {n p . m+J : j G {0, to - 1}} and J p = {j G N : j < n p . m }. 

Notice that max( J p ) < min(I p ) < max(iV). Moreover, I p U J p C J p +i if p ^ ro — 2. 
Hence, by Fact 13.41 we have 

(3.14) e Jp (A) s; e IpUjp (A) ^ e Jp+1 (A) < e /p+lUjp+1 (A) < 1 
for every p G {0, ro — 2} and every A G T . For every A G T let 

(3.15) P A = {p G {0, r - 1} : e IpUjp (A) - e Jp (A) > e 4 /16}. 

The previous discussion implies that the set Pa has cardinality at most [16e~ 4 J • 
Therefore, we may select po G {0, r — 1} such that po ^ Pa for every A G T; in 



A DENSITY VERSION OF THE CARLSON-SIMPSON THEOREM 



15 



particular, ei po u.j PO (A) - e Jpo {A) < e 4 /16. Since I Po n J Po = 0, by Corollary SSI 
we conclude that 

(3.16) \dens(A y ) - dens(A) < e 

for every y G [k] Ip o and every A G T . 

We set M = I Po . We will show that with this choice all requirements of the 
sublemma are satisfied. Indeed, notice that M is a subinterval of N' with \M\ = to. 
We fix A G J. Also let I C M and y G [k] 1 be arbitrary. Observe that for every 
z G [fe]* r \- r we have (y,z) G [/c] /p o. Hence, 

(3.17) | dens ( A y )- dens {A) \ = \& ze[k] M\iAens{A( y ^ z) ) - dens(A)| 

< E ze[k]M\4 deils ( A (y,z)) -dens(A)| s$ e 
and the proof is completed. □ 

We are in the position to complete the proof of Lemma I3~2"1 To this end, we need 
to introduce some numerical invariants. Specifically, for every < e ^ 1 and every 
k, (, q G N with k > 2 and £, g > 1 let 

(3.18) p = p(k,£,q,e) = min{e,k- e /2} 
and define Fk,t,q,e : N — > N by the rule 

(3.19) *M, 9 ,e(m) = (?Ll6p" 4 J + l)m + 1. 
Proof of Lemma \3.SX We will show that 

(3.20) Beg(k,£,q,e) < Fg g £ (0) 

for every < £ ^ 1 and every J:,/,? e N with fc ^ 2 and ^, g ^ 1. Indeed, let iV 
be a finite subset of N with |iV| ^ Fjfj e (0) and fix a family J 7 of subsets of [fc] <N 
with = g. We select a subset M of N with |M | = Fjfj q e {0). By repeated 
applications of Sublemma 13.71 we may construct a family {Mi, M^_i} of finite 
subsets of Mo such that for every i G [I — 1] 

(a) iM^Fg^O), 

(b) Mi is a subinterval of Mj_i \ {max(Mi_i)} and 

(c) for every ie J, every subset I of Mj and every y G A: 7 we have 

(3.21) dens({ii; G [kf : (y,w) G A n [jfc] max ( A/ *-' )}) - dens(A n [fc] max ( Mi - 1 >)| <^ e 

where C = {to G N : to < max(M ; _i)} \ /. 

We set L = {max(M^_i) < ... < max(Af )}. Using properties (b) and (c) it is easy 
to check that the family T is (e, L)-regular, as desired. □ 
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4. A VARIANT OF THE GRAHAM-ROTHSCHILD THEOREM FOR LEFT VARIABLE 

WORDS 

Recall that for every Carlson-Simpson tree V of [fc] <N and every I £ [dim(V')] 
by Subtrf(V r ) we denote the set of all ^-dimensional Carlson-Simpson subtrees of 
V. This section is devoted to the proof of the following partition result. 

Theorem 4.1. For every integer k ^ 2, every pair of integers d m J? 1 and 
every integer r ^ 1 there exists an integer N with the following property. If 
n ^ N and W is an n- dimensional Carlson-Simpson tree of [fc] <N , then for ev- 
ery r-coloring of the set Subti m (W) there exists U G Subtrd(W) such that the set 
Subtr TO (£7) is monochromatic. The least integer N with this property will be denoted 
by CS(fe, d, m, r). 

Theorem 14.11 is, of course, a variant of Theorem 12.11 It can be hardly charac- 
terized as new since it follows using fairly standard arguments. Nevertheless, we 
have decided to include a proof for two reasons. The first one is self-containedness. 
Secondly, because we want to emphasize the bounds we get from the argument for 
the numbers CS(/c, d, m, r). 

We start by introducing some pieces of notation. Specifically, let k,d,m G N 
with k ^ 2 and d m ^ 1. Also let W be a d-dimensional Carlson-Simpson tree 
of [fc] <N and V G Subtr m (W). The depth of V in W, denoted by depth^(F), is 
defined to be the unique integer i G {m, d} such that the m-level V(m) of V is 
contained in the i-level W(i) of W, or equivalently, V(m) G Subs m (W(i)) . We set 

(4.1) SubtC ax (W0 = {V G Subtr m (W0 : depth^fV) = dim(VF)}. 

That is, Subtr" 1 l ax (W / ) is the set of all m-dimensional Carlson-Simpson subtrees of 
W of maximal depth. Part of our interest in this subclass is justified by the following 
simple, though important, fact. Its proof is a rather straightforward consequence 
of the relevant definitions. 

Fact 4.2. For every integer d 1, every d-dimensional Carlson-Simpson tree W 
of [k] <N and every m G [d] the map 

(4.2) Subtr™ ax (W0 3 V4 V(m) 6 Subs m (W(d)) 
is a bijection. 

Combining Theorem 12.11 and Fact 14.21 we get the following corollary. 

Corollary 4.3. Let k G N with k ^ 2. Also let d,m,r G N with d ^ rn ^ 1 and 
7' 1. If n GK(k,d,m,r), then for every n-dimensional Carlson-Simpson treeW 
of [k] <N and every r-coloring of the set Subtr" 1 l ax (14 A ) there exists U G Subtr™ ax (M / ) 
such that the set Subtr™ ax ([7) is monochromatic. 

The proof of Theorem 14. II is based on a strengthening of Corollary 14.31 To state 
it, it is convenient to introduce the following definition. For every k,m,r G N with 



A DENSITY VERSION OF THE CARLSON-SIMPSON THEOREM 



17 



k ^ 2 and m, r ) 1 we define the function gk,m,r ■ N —> N by the rule gk,m,r(n) = 
if n < m — 1 and 

(4.3) 9k, m ,r( n ) = GR(fc, n + 1, m, r) 
if ra ^ m — 1. We have the following lemma. 

Lemma 4.4. Let k,m,r G N wii/i fc ^ 2 and m, r 1. Zei q, n G N witft 

q 1 and n r ( m )- 27ien /or every n- dimensional Carlson-Simpson tree W 

of [fc] <N and every r-coloring of the set Subtr m (W^) there exists U G Subtr™ a j x (J (W / ) 
with the following property. For every pair S,T G Subtr m ([/) wii/i depth [/ (5 f ) = 
depth;j(T) the Carlson-Simpson trees S and T have the same color. 

Proof. We fix a coloring c : Subtr m (W^) — > [r]. For every i G {0,...,q} we set 
Hi = .9fe 9 m 'r( m )- Notice that, by (|4.3p . for every i G {0, q — 1} we have 

(4.4) n t = GR(fc, rii+i + 1, m, r) ^ n 4+ i + 1 > n g = m. 

We select [To e Subtr™ ax (VF). By (|4.4|l and CorollaryH31 we may construct a family 
{Ui, t/g} of Carlson-Simpson subtrees of Uq with the following properties. 

(a) For every i G [q] we have dim([/j) = rij + 1. 

(b) We have £/ x G Subtr^^f/o). Moreover, if q > 2, then for every i G [q- 1] 
we have fj^+i 6 Subtr™ ax ([//) where U< = Ui\ U^n, + 1). 

(c) For every z G [q] the set Subtr™ ax ([/i) is monochromatic with respect to c. 
For every i G [q] let (a,w^\ ...,Wn}) be the generating sequence of Ui. We define 
U to be the Carlson-Simpson tree of [fc] <N generated by the sequence 

(4-5) (c g ,4 9) ,...,^)-«:^, ...,w%,wW)- 

We will show that U is as desired. Indeed, notice first that 

(4.6) dim(C7) = (n q + 1) + (q - 1) = m + q. 

Also observe that U G Subtr^({7 ) and so U G Subtr^(W). Finally let i G [q] 
be arbitrary and set in = q— £+1 G [q]. By the definition of U and (b) above, we see 
that U (m + £) is contained in Ui t (ni e + 1). Hence, for every pair S,T 6 Subtr m (£7) 
with depth [/ (5) = depth [/ (T) = m + £ we have that S, T G Subtr™ ax ([/ lf ). Invoking 
(c), we conclude that c(S) — c(T) and the proof is completed. □ 

We are ready to proceed to the proof of Theorem 14.11 

Proof of Theorem\Jl\ Let k G N with k ^ 2. Also let d, m, r G N with d ^ m ^ 1 
and r ^ 1. We will show that 

(4.7) CS(fc,d,m,r) s: 5 g„ r ; m) (m). 

Indeed, let n ^ 5^ ^1 ( m ) and be an arbitrary n-dimensional Carlson-Simpson 
tree of [fc] <N . We fix a coloring c : Subtr m (VF) — > [r]. By Lemma 14.41 there 
exists a Carlson-Simpson subtree R of W with dim(ii) = d ■ r such that for every 
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S G Subtr m (i?) the color c(5) of S depends only on the depth of S in R. Therefore, 
by the classical pigeonhole principle, there exist a subset / of {0, ...,d ■ r} with 
|/| = d + 1 and ro G [r] such that for every i £ I and every S G Subtr m (i?) with 
depth fl (S') = i we have c(S) = tq. Let ?7 be any d-dimensional Carlson-Simpson 
subtree of R which is contained in the set \J ieI R(i). By the previous discussion, 
we see that the coloring c restricted on Subtr m (£7) is constantly equal to r . The 
proof of Theorem 14.11 is thus completed. □ 

5. The convolution operation 

The concatenation of two finite sequences provides us with a canonical way to 
"glue" a pair of elements of [/c] <N . Our goal in this section is to describe a different 
"gluing" method which will be of fundamental importance throughout the paper. 

The method is particularly easy to grasp for pairs of sequences of given length. 
Specifically, let n,m ^ 1 and fix a subset L of {0, ...,n + m — 1} of cardinality n. 
Given an element x of [k] n and an element y of [k] m , the outcome of the "gluing" 
method for the pair x, y is the unique element z of [fc] n+ ™ 1 which is "equal" to x on 
L and to y on the rest of the coordinates. This simple process can, of course, be 
extended to arbitrary pairs of [/c] <N . This is the content of the following definition. 

Definition 5.1. Let k G N with k J? 2 and L = {Iq < ... < be a nonempty 

finite subset o/N. For every i G {0, \L\ — 1} we set 

(5.1) Li = {I e L : I < li} and Li = {n G N : n < k and n f L. t }. 
Also let ul — max(i) — \L\ + 1 and set 

(5.2) X L = [k] nL . 

We define the convolution operation cl ■ [fc] < ' i ' x Xl — > [fc] <N associated to L as 
follows. For every i G {0, \L\ — 1}, every t G [k] 1 and every x G Xl we set 

(5.3) c L (t,x) = (l Li (t),h i (x\\L i \)) 

where Il ; and \-r are the canonical isomorphisms defined in $2.2. 



More generally, let V be a Carlson-Simpson tree of [/c] < and assume that L 



is 



contained in {0, dim(l^)}. The convolution operation Cl.v '■ 

[fc]<l L l x Xl V 

associated to (L, V) is defined by the rule 

(5.4) c L ,v(t,x) =I v (c L (t,x)) 
where ly is the canonical isomorphism defined in $2.5. 

Before we proceed let us give a specific example. Let k = 5 and L = {1, 3, 7, 9}, 
and notice that Xl = [5] 6 . In particular, the convolution operation Cl associated 
to the set L is defined for pairs in [5] <4 x [5] 6 . Then for the pair t = (1,2) and 
x = (3, 5, 4, 2, 4, 1) we have 

(5.5) c L (t,x) = (3,1,5,2,4,2,4) 
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where in (|5.5|) we indicated with boldface letters the contribution of t. 

The rest of this section is devoted to the study of convolution operations. We 
notice that all properties described below follow by carefully manipulating the rele- 
vant definitions. In fact, once the basic definitions have been properly understood, 
most of the material of this section should be regarded as fairly straightforward. 

We begin with the following fact. 

Fact 5.2. Let k G N with k 2. Let V be a Carlson-Simpson tree of [fc] <N and 
L = {Iq < ... < be a nonempty finite subset of {0, dim(V)}. For every 

t G [A:] < I L I we set 

(5.6) n t = {c L , v (t,x):xeX L }. 

Then for every t, t' G [fc] < ' i ' with t ^ t! we have fit H fi# = 0. Moreover, for every 
i G {0, \L\ — 1} the family {fl t ■ t G [k] 1 } forms an equipartition of V(li). 

Proof. By the definition of the convolution operation, we see that fl t n fit' = if 
t 7^ t'. It is also easy to check that the family {fl t : t G [k] 1 } forms a partition 
of V(li). Therefore, to complete the proof it is enough to observe that for every 
i G {0, \L\ — 1} and every t G [k] 1 we have 

(5.7) Iy 1 (n t ) = {xe[k] h :x\ Li =I Li {t)}. 

Clearly this implies that |f2 t | = |O t /| for every t,t' G [k] 1 . □ 

Using similar elementary observations we get the following. 

Fact 5.3. Let k,V and L be as in Fact \5.B. For every t G [fc] < ' L ' and every 
s G [fc] <N we set 

(5.8) Y s t = {xeX L :c LtV (t,x) = s}. 

Then for every t G [A:] < ' L ' and every s,s' G with s ^= s', where f2t is as in (|5 . 
the sets and Yf, are nonempty disjoint subsets of X^. Moreover, the family 
{Y* : s G fit} forms an equipartition of Xl. 

We will also need the following fact. 

Fact 5.4. Let k,V and L be as in Fact 15.^1 For every t G [fc] < ' L ' and every 
s G [fc] <N let ft t o,nd Y* be as in (15.61) and (|5.8I) respectively. Then for every 
i G {0, \L\ — 1} and every t,t' G [k] 1 there exists a map g t j' '■ fit flf with the 
following properties. 

(i) For every s G fit we have that Y" s * = Y* f ^ . 

(ii) The map gt.v is a bijection. 

(iii) The map gt.v preserves the lexicographical order. 

(iv) If t and t' are (r,r')- equivalent for some r,r' G [k] with r r' (see §2.6), 
then s and gt,t'( s ) are (r,r')~ equivalent for every s G fit- 
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Proof. For every s £ f2 t we select x s £ Y* and we set 

(5.9) 9t,t'{s) = c L<v (t',x s ). 

It is easy to check that gt,v is well-defined and satisfies the above properties. □ 

We proceed with the following lemma. 

Lemma 5.5. Let k £ N with k ^ 2. Let V be a Carlson-Simpson tree of [fc] <N 
and L — {Iq < ... < 1\l\-i} be a nonempty finite subset of {0, dim(IX)}. Also let 
t £ [fc]<l L l and A C [fc] <N and set B = c^\ / (A). Then the following hold. 

(i) We have denso t (A) = dens^ t y x x L (B) where f2 t is as in (|5.6[) . 

(ii) For every i £ {0,..., \L\ — 1} we have densy(;.)(A) = densry« x x £ (B). 

Proof. For every s £ fit let Y* be as in (|5 ,8[) . By the definition of B and 51 t , 

(5.10) Bn({t}xX L ) = {{t,x) :c Ly (t,x) £Ann t } 

= IJ {(t,x) :c LtV (t,x) = s} 

= U « x ^- 

seAnQt 

By Fact 15.31 for every s £ Sl t we have 

\Y*\ 1 



(5.H) 
Therefore 



\x L \ \n 



(^?\ dons (m - \Bn({t}*x L \) ism v IWx^ 

(5.12) dens {t}xXi (^) - - ^ 



= ,S„ra - — =™>- 

This completes the proof of the first part of the lemma. To see that part (ii) is 
satisfied, let i £ {0, \L\ — 1} be arbitrary. By Fact l5.2[ we see that 

(5.13) dens v(i .)(A) = E te[fe] idensn t (A). 

By (|5 . 1 3|) and the first part of the lemma, the result follows. □ 

The final three lemmas of this section contain some coherence properties of 
convolution operations. The first one shows that convolution operations preserve 
Carlson-Simpson trees. 

Lemma 5.6. Let k, V and L be as in Lemma \5.5\ Also let W be a Carlson-Simpson 
subtree of [fc] < ' L ' and x £ Xi,. Then, setting 

(5.14) W x = {c L ,v{w,x) : w £ W} , 
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we have that W x is a Carlson-Simpson subtree of [fc] < ' L ' of dimension dim(VF). 
Moreover, for every i E {0, dim(Vt^)} we have 

(5.15) W x (i) = {c Ly (w, x):we W(i)}. 

Proof. Let t — \L\ and {Iq < ... < h-i} be the increasing enumeration of L. Set 

(5.16) J = {n E N : n < l } and Jj = {n E N : - i+ l<n<Zj-i-l} 

for every i E — 1]. Observe that the family {Jo, Ji, ■ Ji-i} forms a partition of 
{0, — £} into successive intervals some of which are possibly empty. Let 

(5-17) c = I7>| Jo ). 

Also, for every i € {0, £ — 2} we define 

(5.18) Wi (v)=v~Iji +i (x\ Ji+1 ). 

Clearly Wi is a left variable word over k for every i G {0, — 2}. Let be the 
Carlson-Simpson tree generated by the sequence (c, wq, W£_2) and observe that 
Cf,(t,x) = Is x (i) for every t 6 [fc] < l i L Hence, for every t € [fc] < ' i ' we have 

(5.19) c L , v (t,x)=I v (ls m (t)). 

Using (|5.19p and invoking the definition of W x in (|5.14p . the result follows. □ 

The next result enables us to transfer quantitative information from the space 
[fc] <N to the space on which the convolution operations are acting. 



Lemma 5.7. Let k, V and L be as in Lemma ] 5. 51 Also let W be a Carlson-Simpson 
subtree of [/c] < ' L and x E Xl and define W x as in (|5.14p . If A is a subset of [fc] <N 
and B = Cj^ v {A), then for every i E {0, dim(V4 7 )} we have 

(5-20) dens^ x(l) (^) = dens w {i) x {x} (B) . 

Proof. We define the Carlson-Simpson tree S x exactly as we did in the proof of 
Lemma \5. 61 By (|5.19[) . for every t E [A:] < ' 1 '' we have that (t,x) E B if and only if 
Iy(Ig x (t)) E A and the result follows. □ 

We close this section with the following lemma. 



Lemma 5.8. Let k, V and L be as in Lemma \5.5[ Also let W be a Carlson-Simpson 
subtree of [/c] < ' L ' and A be a subset of [k] <N . Then for every i E {0, dim(VL r )} 
we have 

(5.21) dens CLv{W ( l)xXL )(A) = E xe x L dens W:i:{i) (A) 

where W x is as in (I5.14|) . In particular, for every ze{0,...,|L| — 1} we have 

(5-22) dens v{h) (A) = E xe x L dens Ro:{i) (A) 

where R x — {cL.v{t,x) : t E [fc] < l i '} for every x E Xl. 
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Proof. Let i G {0, dim(M / )}. There exists a unique I G {0, \L\ — 1} such 
that W(i) is contained in [k] 1 . By Fact I5.2( the family {f2 t : i G forms an 

equipartition of CL,v{W(i) x X^). Therefore, by Lemma 1531 

(5.23) dens Ci iV (w(*)xXz,)(^) = ^tew(i)densn t ( A ) 

= E teM/(l) dens {t}xXL (B) 

= dens w{l)xXL (B) 

= E xe x I ,dens W /(i)x{a:}(-S) 

15^CT ... 

= E xeXli aeiis Wx ( i ){A). 

Finally notice that V(U) = Cz,.y([fc] 1 x X^) for every i G {0, \L\ — 1}. Therefore, 
equality (|5.22[) follows by (|5.21[) and the proof is completed. □ 

6. Iterated convolutions 

Our goal in this section is to study iterations of convolution operations. We 
remark that this material will be used only in §9. The exact statements that we 
need are isolated in §6.2. 

6.1. Definitions and basic properties. We start with the following definition. 

Definition 6.1. Let L = (L n )f l=0 be a finite sequence of nonempty finite subsets 
ofN. Also let k G N with k 2 and V = (V n )f l=0 be a finite sequence of Carlson- 
Simpson trees of [k] <N with the same length as L. We say that the pair (L, V) 
is ^-compatible, or simply compatible if k is understood, provided that for every 
n G {0, ...,d} we have L n C {0, dim(V^)} and, if n < d, then V n +i C [fc] < ' i "L 

Notice that if (L, V) is a compatible pair and L',V are initial subsequences of 
L, V with common length, then the pair (I/, V) is also compatible. Also observe 
that for every compatible pair (L, V) = ((L n )^_ , (V n )^ =0 ) and every n G {0, d} 
we can define the convolution operation CL n ,V n '■ [fc] < ' i "' x X^ n — > V n associated to 
(L n , V n ) as described in §5. What Definition 16.11 guarantees is that for compatible 
pairs we can iterate these operations. This is the content of the following definition. 

Definition 6.2. Let k G N with k ^ 2 and (L,V) = ((Z n )£ =0 , (V n )* =0 ) be a 
k-compatible pair. We set 

d 

(6-1) X L = IJX L „. 

n=0 

By recursion on d we define the iterated convolution operation 
(6.2) c L ,v : [k} <lLdl xX^Vo 

associated to (L, V) as follows. For d — this is the CL y convolution operation 
defined in (|5.4[) . 
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If d ^ 1, then let L' = (Z w )„_q and V = (Ki)„=o assume that the operation 
cl',v &een defined. We set 

(6.3) c L ,v(s,x , ...,x<i) = cv,w(cL d ,v d ( s > x d),xo, —,Xd-l) 

for every s G [k] < > Ld > and every (xq, ...,Xd) G X^,. In this case, the quotient map 

(6.4) q L . v : [fc] <|Ld| x X L -> [fc] < l Ld - 1 l x X L , 
associated to (L, V) is defined by the rule 

(6.5) qL,v(t, x, x) = (c Ld:Vd (t, x), x) 
for every t G [/c] < ' id ' and every (x, x) G Xi/ x XL d . 

The rest of this subsection is devoted to several lemmas establishing proper- 
ties of iterated convolutions. Just as in §5, all this material follows by carefully 
manipulating the relevant definitions. We begin with the following elementary fact. 

Fact 6.3. Let k ^ 2 and (L.V) = ((i n )„_ , (V n )^ =0 ) be a k- compatible pair. If 
Ol and (L',V')= ((£n)n=o,04)n=o), faCL,v-C L .v-oqLv. 

The next two lemmas are multidimensional analogues of Lemmas 15.61 and 15.71 

Lemma 6.4. Let k 2 and (L, V) = ((L n )„_ , (V^)^_ ) 6e a k-compatible pair. 
Also let W be a Carlson-Simpson subtree of [fc] < ' id ' and x G Xl. TTien the set 

(6.6) W x = {c L)V (w, x) 

is a Carlson-Simpson subtree of Vq with the same dimension as W. Moreover, for 
every i G {0, dim(VT)} we have 

(6.7) W x (i) — |cl : v(w,x) : w G TF(i)}. 

Proof. Both assertions are proved by induction on d and using similar arguments. 
We will give the details only for the first one. The case "d = 0" is the content 
of Lemma 15.61 So, let d ^ 1 and assume that the result has been proved up to 
d — 1. Fix a compatible pair (L, V) = ({L n )^ =Q , (T4)^=o) an d let W and x be as 
in the statement of the lemma. Write x = (xq, ...,Xd) and set x' = (xq, ...,Xd—i) 
and S = {c Ld y d (w,x d ) : w G W}. Also let V = {L n ) d n Z} and V = (V n ) d n Zl 
and observe that the pair (L', V) is compatible. By Lemma [5.6[ S is a Carlson- 
Simpson subtree of Vd with dim(S') = dim(TF). By Definition 16. l\ we have that S is 
contained in [fe] < l i| *- 1 l. Therefore, applying the inductive assumptions for the pair 
(L', V), we see that S x > is a Carlson-Simpson subtree of Vq of dimension dim(VF). 
Noticing that S x > coincides with W x the result follows. □ 

Lemma 6.5. Let k, L,V,W^ and x be as Lemma \6.4\ Also let A be a subset of 
[fc] <N and set B = c^v(A). Then for every i G {0, dim(TF)} we have 

(6.8) dens 1 v( l)x { x }( J B) = dens Wx ^ (A). 
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Proof. By induction on d. The case "d = 0" follows from Lemma [5.71 Let d ^ 1 
and assume that the result has been proved up to d — 1. Fix a fc-compatible pair 
(L, V) = ((i n )n=o: (Ki)n=o) an d let W, x, A and i? be as in the statement of the 
lemma. Write x = (xq, Xd) and define x', S, L' and V precisely as in the proof 
of Lemma RT4l We set C — c^, V ,(A). For every y e Xi/ let C y and B y be the 
sections of C and £? at y. By Fact 16.31 we see that B y = y (C y ). Hence, 

(6.9) densvi/( 4 )x{x}(-B) = dens V K(i)x{x'}x{x ti }(-B) 

= densvK(i)x{x'}x{x d }(-Bx' x {x'}) 

= dens W (i) X {a: ti }(-Bx') 

= dens w(i)x { a:d }(c^y d (C x /)) 

Invoking Lemma 15.71 we have 

(6-10) dens^x^^c^^CxO) = dens S (i)(C x /). 

Next observe that 

(6.11) dens sw (C X ') = dens s(l)x{x , } (C x , x {x'}) = dens s(i)x{x , } (C). 

As we have already pointed out in the proof of Lemma 16.41 the set S x i coincides 
with Wx. Since C = c£, 1 v , (A), we may apply our inductive hypothesis to the 
Carlson-Simpson tree S, the element x' and the set A to infer that 

(6.12) dens S ( i)x{x / } (C) = dens Sx , w ( A) = dens Wx ^(A). 

Combining equalities (|6.9p up to (|6.12[) the result follows. □ 

We proceed with the following lemma. 

Lemma 6.6. Let k ^ 2 and (L, V) = ((L n )n=oi (^n)n=o) be a k-compatible pair. 
Assume that d ^ 1 and let L' = (£ n )n=o an d V = (Ki)n=o- -^ e ^ C be a subset of 
[fc]<|id-il x X L / and sei B = q L 1 v (C). Finally let t E [fc]<l Ld l and set 

(6.13) ^{c^y^x) :x€X Ld }. 
Then q£ y(f2 t x Xl') = {t} x Xl and 

(6.14) denso t xx L ,(C) = dens {i}xXL (B). 

Proof. By the definition of the quotient map qL,v in (|6.5[) . we have 

(6.15) q ^ v (n t x XlO = cil Vd {n t ) x x L ,. 

Since ^ (fi t ) = {£} x X Ld , by (|6.15p . we see that q£ v (ft t x X L ') = {t} x X L . 

Now for every x' € Xi/ let C x < and -B x ' be the sections of C and B at x' 
respectively. Observe that C x > C [fc] < l id - 1 l and £> x / C [fc] < l Ld l x XL d . Also notice 
that Byj — c^* y d (C x <) for every x' G Xiy. Hence, by Lemma [5751 

(6.16) dens {t}xXld (B x ,) = dcns nt (C X ') 
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for every x' G Xiy. Therefore, 

(6.17) dens {t}x x L (B) = E x , e x L ,dens {t}xXid (£f x /) 

i6TT6l 



t'ev L ,dens^ f (C x /) = dens nt x v L , (C) 

as desired. □ 

We close this subsection with the following consequence of Lemma 16.61 

Corollary 6.7. Let k 2 and (L, V) = ((L n )f l=0 , (V^)n=o) ^ e a k-compatible pair. 
Assume that d ^ 1 and /e£ L' = (L ra )n=o ari ^ V' = (Ki)n=o- A be a subset of 
[k] <N and set A d = c L V(^) and A^ 1 = c h }^,(A). Then for every t G [fc] <|id| 

(6.18) densx L (Af) = E sef2t dens XL , (At 1 ) 

where Af is the section of A d at t, fi t C V d C [/c] < l Ld - 1 l is as m (|6.13l) arwi A^ 1 
is the section of A 11 " 1 at s. 

Proof. We fix t G [fc]<l id l. Notice that 

(6-19) A d = c^ v (A) = q^VfeV^)) = qL.V^^ 1 )- 

By (jUTTO]) and Lemma IBUl applied to the sets "B = A d " and "C = we obtain 

(6.20) dens XL (^) = dens {t})<XL (A rf ) 

™ dens^xv,,^- 1 ) =E se o t densx L ,(Af- 1 ) 

and the proof is completed. □ 

6.2. Consequences. As we have already mentioned, in this subsection we will 
collect some results which will be of particular importance in §9. The first two of 
them follow by repeated applications of Corollary 16.71 The details are left to the 
reader. 

Corollary 6.8. Let k > 2 and (L, V) = ((L n ) d =Q , (Kx)«=o) ^ e a k-compatible 
pair. Let < 7 ^ 1 and A be a subset of [k] <N . We set A d = ^^(A) and 
A = c^ Q v (A). Suppose that densx Lo (A®) ^ 7 for every s G [fc] < l i °l where A® 
is the section of A at s. If t € [/c] < ' L<i ' and Af is the section of A d at t, then 

Corollary 6.9. Let k ^ 2 and (L, V) = ({L n ) d =Q , (V n ) d =Q ) be a k-compatible 
pair. Let < A ^ 1 and A and B be two subsets of [k] <N with A C B. We set 
A d = c L V(A), A = c- L l Vo {A), B d = c L V(S) and B° = (£^(5). Suppose that 
densx io (A®) ^ A ■ deasx Lo {Bg) for every s G [fc] < l L °l where A® and B° are the 
sections of A and B° at s. If t G [fc] <|idl , then dens Xlj (A d ) ^ A • densx L (5 t d ) 
where Af and B d are the sections of A d and B d at t. 



The final result is a consequence of Lemma 16.61 
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Corollary 6.10. Let k 2 and (L, V) = ((L n )^ =0 , (V„)f l=0 ) be a k-compatible 
pair. Assume that d ^ 1 and let XJ — (L„)^~ and V = (Vn) n Z.g. Let t G [/c] < ' Ld 
and Co oe a nonempty subset of fl t X Xl,/ where fit is as in (|6.13[) . j4Zso /ei 
Ci C [fc] < ' id_1 ' x Xl' • Then, setting Bi — q£^ v (Ci) /or every i £ {0,1}, we Ziaue 
iftarf densc (Ci) = denss (-Bi). 

Proof. Notice that q^VC^o nCi) =B nB 1 . Since q^V^t x X L /) = {i} x X L , 
we see that Bq C {£} x Xl. Therefore, 

-o-n , |C nCi| dcns^ xXL< (CpnCi) 
6.21 densc Ci) = — r-^- — = — ^ -— — 

| C 1 densa t xx L ,(C ) 

dens {t}xXL (g n B x ) 

dens {t}xXL ( J B ) 
iBonBi 



dens Bo (Bi) 



as desired. □ 

7. Preliminary tools for the proof of Theorem B 

In this section we will gather some results that are part of the proof of Theorem B 
but are not directly related to the main argument. Specifically, in §7.1 we prove the 
first instance of Theorem B which can be seen as a variant of the classical Sperner 
Theorem |35j . In §7.2 we show how one can estimate the number DCS(fc, m + 1,5) 
assuming that the numbers DCS(fc, m,/3) have been defined for every < f3 ^ 1. 
This result is part of an inductive scheme that we will discuss in detail in §8.1. 
Finally, in §7.3 we present some consequences. 

7.1. Estimating the numbers DCS(2, 1,8). We have the following proposition. 

Proposition 7.1. Let < 8 ^ 1. Also let A be a subset of [2] <N and N be a finite 
subset of N such that 

(7.1) |X| ^Reg(2,CS(2, [175- 2 l,l,2) + l,l, ( 5/4). 

If \A n [2]™ | S2 n for every n £ N , then there exists a Carlson-Simpson line R of 
[2] <N which is contained in A. In particular, 

(7.2) DCS(2, 1,6") < Reg(2, CS(2, [17<r 2 l , 1, 2) + 1, 1, 8/4). 

We should point out that the estimate for the numbers DCS(2, 1,8) obtained by 
Proposition 17. II is rather weak and far from being optimal. However, the proof of 
Proposition [771] is conceptually close to the proof of the general case of Theorem B, 
and as such, should serve as a motivating introduction to the main argument. 

We start with the following lemma. 



Lemma 7.2. Let A and N be as in Proposition \7.1\ Then there exists LCJV with 
(7.3) |L| = CS(2, ri7<r 2 l,l,2) + l 
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and satisfying the following property. Let cl : [2] < ' L ' x Xl — > [2] <N be the convolu- 
tion operation associated to L and set B — c L 1 {A). Then for every t G [2] < ' i ' we 
have dens(-Bt) 3(5/4 where B t = {x G Xl '■ (t.x) G B} is the section of B at t. 

Proof. By Lemma 13.21 and our assumptions on the size of the set N, there exists a 
subset L of N with \L\ = CS(2, |"17d~ 2 l,l,2) + l and such that the family T := {A} 
is ((5/4, L)-regular. Write the set L in increasing order as {l < ... < iiz,i_i} and 
for every i G {0, \L\ - 1} let L{ and L, be as in ([53]) . Since \A n [2]"| ^ 52" for 
every n G L and the singleton {A} is (<5/4, L)-regular, we see that 

(7.4) dens({u; G [2] X * : (y, w) G A n [2]' 1 }) > 3(5/4 

for every i G {0, |L| — 1} and every y G [2] Li . By the definition of the convolution 
operation in (|5.3[) . for every i G {0, \L\ — 1} and every t G [2] 1 we have 

(7.5) fit {cl(*,x) :xGX L } = {zG [2]' 1 : «| Lj = Il«(*)}. 
Thus, by (|7.4|l applied to "y = Ii 4 (t)", we obtain 

(7.6) dens 0t (A) = dens({w G [2] r * : (<),«;) G An [2]'*}) > 3(5/4. 
Finally, by Lemma 15.51 we have 

(7.7) dcnsn t (A) = dens {t}xXj - (B) = dens XL (B t ). 

Combining (|7.6I) and (|7.7I) the result follows. □ 

For the next step of the proof of Proposition 17.11 we need to introduce some 
terminology. Specifically, let W be an m-dimensional Carlson-Simpson tree of 
[2] <N and (c, Wq, be its generating sequence. Also let t, t' G W. We say 

that t' is a successor oft hi W if there exist i, j G {0, m — 1} with i j as well as 
a,, ...,aj G [2] such that t' = t^Wiifli)^ ...^Wjipj). If, in addition, we have a, = 1, 
then we say that t' is a Ze/£ successor of i in W. 

Lemma 7.3. Let L and {B t : t G [2]<I L I} 6e as m Lemma \ 7. 2\ Then there exists 
a Carlson-Simpson subtree W of [2] < ' L wi/i dim(V4 7 ) = [17<5 -2 ] and such that 

(7.8) dens(B t n B t ,) ^ S 2 /16 
for every t,t' GW with t' left successor of t in W . 

Proof. For every Carlson-Simpson line S of [2] <N let us denote by (cs,ws) its 
generating sequence. We set 

(7.9) C = {S G Subtr 1 ([2]<l L l) : dens(£ cs n B c >s(1) ) > 6 2 /m}. 

By Theorem l4.1l and (|7.3p . there exists a Carlson-Simpson subtree 14 7 of [2] < ' L I with 
dim(W) = [17(5- 2 ] such that either Subtr x (W) C £ or Subtri(VF)n£ = 0. Observe 
that for every t,t' £ W we have that i' is a left successor of t in VF if and only if there 
exists a Carlson-Simpson line S of W such that t = cs and t' = CcU»s(l). Therefore, 
the proof will be completed once we show that Subtri(W^) fl£ ^ 0. To this end we 
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argue as follows. Let d = dim(W) = \175~ 2 ~\ and (c, wq, Wd-i) be the generating 
sequence of W. We set t = c and U = c~Wo{l)^ ... for every i G [d]. By 
our assumptions, we have dens(Bt i ) ^ 35/4 for every i G {0, Hence, by 

Lemma [2Jl applied for "e = 3(5/4" and "0 = 5/4", there exist i,j G {0, ...,d} with 
i < j and such that dens(Bt i Pi B tj ) ^ 5 2 /16. If i? is the unique Carlson-Simpson 
line of W with cr = ti and wr = ...^Wj-i, then the previous discussion implies 
that R G £, as desired. □ 



The following lemma is the last step towards the proof of Proposition 17.1 



Lemma 7.4. Let W be the Carlson-Simpson tree obtained by Lemma \7.3\ Then 
W contains a Carlson-Simpson line S such that 



(7-10) P| B t + 



Proof. As in Lemma 1731 let d = dim(W) — |~17<5 -2 ] and (c, Wq, ■-, Wd-i) De the 
generating sequence of W. For every i G {0, d — 2} we set 

(7.11) t 4 = cTu>o(2)~...~it;i(2) and s, = t~'U?i + i(l)~...~u; e 2_i(l). 

Observe that is a left successor of U in W^. Therefore, by Lemma 17. 3[ setting 
d = B tl n B s% we have dens(Ci) ^ 5 2 /16 for every i G {0, ...,d - 2}. Also let 
Sd-i — c"wq(2)~ ...~Wd-i(2) and Cd-i = B Sdl and notice that, by Lemma [7.21 
we have dens(C d _i) ^ 35/4 ^ 5 2 /16. Since d > 16/5 2 there exist ^ i < j < d- 1 
such that Ci n Cj ^ 0. We define 

(7.12) c' = and w = w^ +l ...^w'^y 

where y — Wj^x(l)^ ...^Wg-x(l) if j < d — 1 and y = otherwise. Let S 1 be the 
Carlson-Simpson line of W generated by the sequence (c',w') and observe that 
S = {ti} U {si, Sj}. Hence, 

(7.13) f]B t DCinCj ^0 

tes 

and the proof is completed. □ 



We are ready to proceed to the proof of Proposition 17.11 

Proof of Proposition \7.1\ Let S be the Carlson-Simpson line obtained by Lemma 
31 We select xq G Xl such that x G B t for every teS and we set 



(7.14) R= {c L (t,x Q ) :te S}. 

By Lemma [5751 we have that R is a Carlson-Simpson line of [2] <N . Next, recall that 
B = cJ / 1 (A). Since (t,Xo) G B for every t G S, we conclude that R is contained in 
A, as desired. □ 
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7.2. Estimating the numbers DCS(fc,m + 1,5). Let k, m G N with k ^ 2 
and to ^ 1 and assume that for every < f3 ^ 1 the number DCS(fc, to, /?) has 
been defined. This assumption, of course, implies that for every I G [to] and every 
< /? ^ 1 the number DCS(fc,£,/3) has been defined. Therefore, for every I G [to] 
and every < 5 1 we may set 

(7.15) A(k,e,5)=\S- 1 T)CS(k,£,S)~\ 
and 

(7 - 16) e(t -^' )= isub,.„(w<">^)r 

Moreover, let 

(7.17) A = A (fc, S) = A(k, 1, 6 2 /l6) and 9 = 6 (fc, 5) = G(fc, 1, 5 2 /16) 
and define /ij : N N by the rule 

(7.18) h s {n) = A + \2Qo 1 n\. 

It is, of course, clear that for the definition of Ao, Go and hg we only need to have 
the number DCS(fc, 1, 5 2 /16) at our disposal. The main result of this section is the 
following dichotomy. 

Proposition 7.5. Let k G N with k ^ 2 and assume that for every < j3 ^ 1 £/ie 
number DCS(fc, 1,/?) /ias feeen defined. 

Let < 5 ^ 1 and define A and Go as TO (|7.17p . Also let L be a nonempty 
finite subset ofN and A C [fc] <N suc/i iftai denS[ fc ]f (A) ^ 5 /or every I G £. Finally, 
let n G N iotTO n 1 and assume that \L\ h$(n) where hg is as in (|7.18p . Then, 
setting L$ to be the set of the first Ao elements of L, we have that either 

(i) t/iere exist a subset L' of L \ Lq with \L'\ ^ n and a word to G [fc] £ ° /or 
some £o G £o such that 

(7.19) dens [fc] ^ ({s G [fc] <N : t„ s G A}) ^ 5 + 5 2 /8 
/or every let', or 

(ii) t/iere exist a subset L" of L\L with \L"\ n and a Carlson-Simpson line 
V of [fc] <N contained in A with L(V) C io a^d sitc/i tftat, setting l\ to be 
the unique integer with V(l) C [fc]^ 1 , /or every I G L" we /iawe 

(7.20) dens^/-/! ({s G [fc] <N : fs G A /or even/ t G V(l)}) ^ G /2. 

Proposition 17.51 can be used to estimate the numbers DCS(/c,to + 1,5) via a 
standard iteration. In particular, we have the following corollary. 

Corollary 7.6. Let k, m G N wif/i fc ^ 2 and to ^ 1 and assume that for every 
< ,9 ^ 1 the number DCS(fc,m,/3) /ias 6een defined. Then, for every < 5 1 

(7.21) DCS(fc, m + 1, 5) < 4 r8 ^ 2]) (DCS(fc, m, G /2)). 
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Proof. We fix < 5 ^ 1. For notional convenience we set Nq = DCS (ft, m, Qq/2). 
Let L be an arbitrary finite subset of N with \L\ ^ h^ 8S ~^\Nq) and A be a subset 
of [fc] <N such that densrM<>(A) ^ S for every I E L. By our assumptions on the size 
of the set L and repeated applications of Proposition 17. 5[ it is possible to find a 
subset L" of L with \L"\ ^ A^o and a Carlson-Simpson line V of [ft] <N contained 
in A such that, setting i\ to be the unique integer with V(l) C [k] 1 > we have that 
l\ < min(L") and 

(7.22) dens [kV - ei ({s G [ft] <N :t~s£A for every t G V(l)}) > 6 /2 

for every I G L". By the choice of ATq and (|7.22p . there exists an m-dimensional 
Carlson-Simpson tree U of [ft] <N such that 

(7.23) U C {s G [fc] <N ifsei for every i G 7(1)}. 
Therefore, setting 

(7.24) 5 = V(0) U |J {fu :uGU}, 

tev(i) 

we see that S is a Carlson-Simpson tree of [fc] <N of dimension m + 1 which is 
contained in A. The proof is thus completed. □ 

For the proof of Proposition 17.51 we need to do some preparatory work. Specif- 
ically, let L be a finite subset of N with \L\ ^ 2 and consider the convolution 
operation c^ : [fc] < ' i l x Xl —> [k] <N associated to L. For every x G Xl let 
Rx = {cL(t,x) : t G [fc] < l i l} and recall that, by Lemma l5.6[ the set R x is a 
Carlson-Simpson tree of [k] <N of dimension \L\ — 1. Therefore, we may consider 
the Furstenberg-Weiss measure dp^y associated to R x defined in §2.7. On the 
other hand, we may also consider the generalized Furstenberg-Weiss measure di, 
associated to the set L. The following lemma relates these classes of measures. 

Lemma 7.7. Let k £ N with k 2 and L be a finite subset of N with \L\ ^ 2. 
Then for every subset A of [k] <N we have 

(7.25) E xe x L d^(A) = d L (A). 

In particular, there exists a Carlson-Simpson tree W of [fc] <N of dimension \L\ — 1 
with L(W) = L and such that d^ w (A) > d L (A). 

Lemma 17.71 follows by Lemma 15.81 More precisely, notice that equality (|7.25j) 
follows from (|5.22p by averaging over all i G {0, \L\ — 1}. 
The next fact is straightforward. 

Fact 7.8. Let k,m G N with k ^ 2 and m > 1. ^4Zso let < 77 < 1/2 and 
assume that the number DCS (ft, m, 77) /las oeen defined. Finally, let W be a Carlson- 
Simpson tree o/[fc] <N with dim(VF) A(ft,m, ry) — 1 where A(ft,m, r/) is as m (|7. 15[) . 
TTien ever?/ subset B of [k] <N with d^ w (B) ^ 2?7 contains a Carlson-Simpson tree 
of dimension m. 
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The final ingredient of the proof of Proposition 17.51 is the following lemma. 

Lemma 7.9. Let k,m £ N with k 2 and m 1 and assume that for every 
< /3 ^ 1 t/ie number DCS(/c, m, /3) «as oeen defined. 

Let < p,7 ^ 1 and L be a finite subset ofN with \L\ ^ A(k,m, fr//4) where 
A(k,m, P7/4) is as m ()7.15j) . >Uso Zei B be a subset of [fc] <N suc/i £/ia£ di(-B) ^ p. 
// {A t : i <= -B} is a family of measurable events in a probability space (fi, £,p) 
satisfying fi(At) ^ 7 /or every t £ B, then there exists an m-dimensional Carlson- 
Simpson tree V of [fc] <N which is contained in B and such that 

(7.26) /i(f| A t ) ^e(fc,m,p7/4) 

tev 

where Q(k,m, frf/4) is as in (|7.16j) . 

Proof. We set 7/ = P7/4 and A = A(fc, m, rj). Notice that, by passing to an appro- 
priate subset of L if necessary, we may assume that \L\ = A. By Lemma [7771 there 
exists a Carlson-Simpson tree W of [fc] <N of dimension A — 1 with L(W) = L and 
such that df w (B) > d L {B) > p. For every u £ VL let B u = {t £ B C\W : uj £ A t } 
and set 

(7.27) Y = {ujen: df w (B u ) p 7 /2}. 

Since dp^y(-B) ^ p and fi(A t ) ^ 7 for every t £ B, we see that p(y) > P7/2. By 
Fact I7.8[ f° r every u> £ Y there exists an m-dimensional Carlson-Simpson tree Kj 
of [k] <fi such that V u C B w . Hence, there exist V g Subtr m (VF) and G € £ with 
14, = V for every u> £ G and such that 

< 7 - 28 > ^^ra ^ 9e( ' M 

It is easy to see that V satisfies the conclusion of the lemma. □ 

We are now ready to proceed to the proof of Proposition 17.51 

Proof of Proposition \ 7. 5\ Let M = L\ Lq and set Mi = {m — £ : m £ M} for every 
£ £ Lq. Moreover, for every £ £ Lq and every t £ [k] let 

(7.29) A t = {s£ [fc] <N : fs £ A}. 

By (I7.16|) and (|7.17|) . we see that 0o sC S 2 /8. Hence, for every £ £ Lq we have 

(7.30) \M e \ = \M\ > 20o 1 n ^ 8<T 2 n. 
Also observe that for every £ £ Lq we have 

(7.31) E te[kV d Me (At) =d M (A). 
Next recall that deiis^e(A) (5 for every £ £ L. Therefore, 

(7.32) d Lo {A) > (5 and d M (A) ^ 5. 
We consider the following cases. 
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Case 1: there exist £ G L and t E [k] e ° such that &M io (At ) > 8 + <5 2 /4. In this 
case we have that 



(7.33) \{m G Af, : dens [fc]m (A to ) >5 + 8 2 /8}\ ^ {8 2 /8)\Me \ > n. 

We set L' = {m G M : dens [fe]m -£ (A t „) > (5 + <5 2 /8}. By ([7733]) . we see that with 
this choice the first alternative of Proposition [731 holds true. 

Case 2: /or every £ G £o and every t G [fc] we /iawe (Jm,(j4() < <5+<5 2 /4. Combining 
(|7.31|) . (|7.32[) and taking into account our assumptions, in this case we see that for 
every £ G L Q we have 

(7.34) |{t G [fc] £ : d M ,(A t ) ^ S/2}\ > (1 - */2)fcf 
We set 

(7.35) B = |J {t G A n [fcf : d M , (A t ) > 5/2}. 

By ((7321) and (I7T34)) . we get that d Lo (B) ^ o"/2. Let 

(7.36) (n,/,)= n ([fc]< N ,d M£ ) 

be the product of the discrete probability spaces ([fc] <N ,dM f )- For every t E B wc 
define a measurable event A t of f2 as follows. We set 

(7.37) A t = U X* 

where X[ = At if £ = \t\ and X[ = [fc] <N otherwise. Notice that for every £ G Lq 
and every t € B D[k} e we have 

_ fT35l 

(7.38) fi(At) =d Mt (At) ^ 6/2. 

Recall that \L \ = A ^ A(fc, 1, 5 2 /16). Since d Lo (B) ^ 5/2, by Lemma EH 
applied for "p = 7 = 8/2" , there exists a Carlson-Simpson line V of [fe] <N which is 
contained in B and such that 

(7.39) Kn^) >e(M,<s 2 /i6)™e . 

Notice, in particular, that the level set L(V) of V is contained in Lo- Let £\ to be 
the unique integer with 1^(1) C [fc]^ 1 and observe that £\ G Lo- By the definition 
of the events {A t : t G £?} in (|737|) and ([7391 . we get that 

(7.40) d M% ( f) A t )=/*( fl ^)^ Q o- 

tev(i) tev(i) 

Let 

(7.41) Ml = [m E M tl : dens [fc]m ( f| A t ) > 60/2}. 

tev(i) 
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By ([7T30]) . we have 

fT30l 

(7.42) \M' tl \ > (00/2)1^1 ^ n. 
We set 

(7.43) L" = {£i+m:me M' ti }. 

It is clear that \L"\ ^ n. Moreover, L" is contained in L \ Lq and so £\ < min(L"). 
Finally, notice that for every £ G L" we have that £ — £\ G . Therefore, by 
(fT729|) and (fT4T|) . we conclude that 

(7.44) dens^-ij ({s 6 [fc] <N : fs G A for every t G V(l)}) ^ o /2. 

That is, the second alternative of Proposition 17.51 is satisfied. The above cases are 
exhaustive and so the proof is completed. □ 

7.3. Consequences. Let k,m G N with k 2 and m ^ 1 and assume that for 
every < /3 ^ 1 the number DCS(fc, to, /?) has been defined. For every < 7 ^ I 
we set 

(7.45) 9(k, to, 7) = Q(k, to, 7/4) 

where 0(fc, to, 7/4) is as in (|7.16j) . Recall that for every Carlson-Simpson tree W of 
[fc] <N and every k 1 G {2, k} by W \ kl we denote the fc'-restriction of W defined 
in (|2.14[) . We have the following corollary. 

Corollary 7.10. Let k,m G N with k ^ 2 and to ^ 1 and assume that for every 
< j3 ^ 1 i/ie number DCS(fc, m,/3) /ias 6een defined. 

Let < 7 ^ 1 and d G N ttra£/i d ^ A(fc, m,7/4) — 1 where A(/c,to,7/4) is as in 
(|7.15|) . Also let V be a Carlson-Simpson tree of [k + 1] <N with 

(7.46) dim(V) ^ CS(fc+l,d,m,2). 

// {A t : t G V} is a family of measurable events in a probability space (f2, 
satisfying [i{A t ) ^ 7 for every i £ V, then there exists W G Subtr £ ;(V A ) suc/i £/ia£ 
/or every U G Subtr.,„(W / ) we /save 

(7.47) /i( f| A,) ^ 0(fc,m,7) 

where 9(k,m,~f) is as in (|7.45p . 

Proof. We set # = #(fc, to, 7) and we define 

(7.48) U = \U G Subtr ro (F) : m( f| A,) ^ 0}. 

By ()7.46|) and Theorem 14. 1[ it is possible to select W G Subtr^F) such that either 
Subtr m (H / ) CUor Subtr m (W / ) flW = 0. Therefore, it is enough to show that 
Subtr m (W^) nW^ 0. 
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To this end we argue as follows. Let lw '■ [k + l] <d+1 — > W be the canonical 
isomorphism associated to W and for every t G [k] <d+1 set A' t — Ai w / t y By Lemma 
17.91 there exists an m-dimensional Carlson-Simpson subtree R of [/c] <d+1 such that 

(7.49) M ( f) A' t ) > 6. 

tev 

Let S be the unique element of Subtr m (W) such that S \ k = lw(R). Then, by 
(|7.49|) , we conclude that S €U and the proof is completed. □ 

The final result of this section is the following corollary. 

Corollary 7.11. Let k,m G N with k ^ 2 and to ^ 1 and assume that for every 

< P ^ 1 the number DCS(/c,to, (3) has been defined. 

Let < 7 ^5 1 and d G N with d A(k,m, 7/4) — 1 where A(fc, m, 7/4) is as in 
(|7.15[) . yl/so fet V be a Carlson-Simpson tree of [k + 1] <N wii/i 

(7.50) dim(W) > CS(fc + l,d,m,2). 

Finally let {A t : i G V} be a family of measurable events in a probability space 
(fi, E,/i) satisfying (J>(At) 7 /or every t G V. Assume, in addition, that there 
exists r G [fc] smc/i £/ia£ Aj = A{/ /or every t,t' GV which are {r,k + l)-equivalent 
(see $2.6). Then there exists W G Subtr [ j(V r ) such that for every U G Subtr m (W) 
we /icwe 

(7.51) /i( P| A„) >0(fc,m, 7 ) 
where 0(fc, to, 7) is as in (|7.45p . 

Proof. Since A t = A t / for every t,i' £ F which are (r, fc + l)-equivalent, for every 

1 G [dim(y)] and every R G Subtr^(V) we have 

(7.52) [\A t = f| A t . 

ten teflrfc 

Using this observation, the result follows by Corollary 17. 101 □ 

8. A PROBABILISTIC VERSION OF THEOREM B 

8.1. Overview. In this subsection we give an outline of the proof of Theorem B. 
Very briefly, and oversimplifying dramatically, the proof proceeds by induction on 
k and is based on a density increment strategy. 

The first step is given in Corollary 17.61 Indeed, by Corollary 17. 6[ the proof of 
Theorem B reduces to the task of estimating the numbers DCS(fc, 1, 6). To achieve 
this goal we follow an inductive scheme that can be described as follows: 

(8.1) DCS(/c, to, f3) for every to and f3 => DCS(fe + 1, 1, 5). 

Precisely, in order to estimate the numbers DCS(fc + 1,1, S) we need to have at our 
disposal the numbers DCS(A:,to, /3) for every integer to ^ 1 and every < j3 ^ 1. 
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The base case - that is, the estimation of the numbers DCS(2, 1, S) - is, of course, 
the content of Proposition 17.11 

At this point it is useful to recall the philosophy of the density increment method. 
One starts with a subset A of a "structured" set S of density S and assumes that A 
does not contain a subset of a certain kind. The goal is then to find a sufficiently 
large "substructure" S' of S such that the density of A inside S' is at least (5 + 7, 
where 7 is a positive constant that depends only on S. Usually this task is rather 
difficult to achieve at once, and so, one first tries to increase the density of A inside 
a relatively "simple" subset of S. We refer to the essay [18] of W. T. Gowers for a 
thorough exposition of this method. 

The proof of the inductive scheme described in (|8.1|) follows the strategy just 
mentioned above. Specifically, fix the parameters k and S and let A be a subset of 
[k + 1] <N not containing a Carlson-Simpson line such that dens[fc +1 ]»> (A) ^ 6 for 
sufficiently many n G N. What we find is a Carlson-Simpson tree W of [k + 1] <N 
such that the density of the set A has been significantly increased in sufficiently 
many levels of W. This is done in two steps. Firstly we show that there exists a 
Carlson-Simpson tree V of [k + 1] <N and a subset D of V which is the intersection 
of relatively few insensitive sets and correlates with the set A more than expected 
in many levels of V (it is useful to view D as a "simple" subset of V). This is 
the content of Corollary 18.61 below. In the second step we use this information to 
achieve the density increment. We will not comment at this point on the second 
step, since we will do so in §9.1; here we simply mention that the statement of main 
interest is Corollary 19. 151 

We will, however, discuss in detail the proof of the first step which is analogous 
to the first part of the polymath proof of the density Hales- Jewett Theorem. In 
fact, this is more than an analogy since we are using a beautiful argument from the 
polymath proof (see [28l §7.2]) to reduce the proof of Corollarv l8.6l - the main result 
of this step - to a "probabilistic" version of Theorem B. The analogy, however, with 
the polymath proof breaks down at this point and the main bulk of the argument 
is quite different. 

The aforementioned "probabilistic" version of Theorem B refers to the question 
whether a dense subset of [k + 1] <N not only will contain a Carlson-Simpson line 
but, actually, a non-trivial portion of them. Results of this type figure prominently 
in Ramsey Theory and have found significant applications (see, e.g., [32] and 
the references therein). A classical result in this direction is the "probabilistic" 
version of Szemeredi's theorem, essentially due to P. Varnavides [39], asserting that 
for every integer k ^ 3 and every < 5 $C 1 there exists a constant c(k, 5) > 
such that every subset A of [n] with \A\ ^ 5n contains at least c(k,S)n 2 arithmetic 
progressions of length k, as long as n is sufficiently large. 

However, some density results do not admit a "probabilistic" version of the form 
stated above. The most well-known example is the density Hales- Jewett Theorem. 
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Indeed, if n is large enough, then one can find a highly dense subset of, say, [2] n 
containing just a tiny portion of combinatorial lines; see [28\ §3.1]. This example 
can modified, in a straightforward way, to show that Theorem B also fails to admit 
a naive "probabilistic" version. 

This phenomenon appears to be quite discouraging, but it can be bypassed. So 
far there has been only one method in the literature dealing with this problem. 
It was introduced by the participants of the polymath project and was based on 
the technique of changing the measure. Part of the novelty of the present paper 
is the development of a new method which not only is conceptually easy to grasp 
but also appears to be quite robust (the clearest sign for this is that it can be 
combined with the arguments in [28] to give a very simple proof [TTJ of the density 
Hales- Jewett Theorem). The idea is to avoid the pathological behavior by passing 
to an appropriate "substructure" . This is done applying the following three basic 
steps. We will describe them in abstract setting since we feel that no clarity will 
be gained by restricting our discussion to the specifics of Theorem B. 

Step 1. By an application of Szemeredi's regularity method [37], we show that a 
given dense set A of our "structured" set S is sufficiently pseudorandom. This 
enables us to model the set A as a family of measurable events {B t : t 6 TZ} in a 
probability space (fi, E, //) indexed by a Ramsey space TZ closely related, of course, 
with S. The measure of the events is controlled by the density of A. 

Step 2. We apply coloring arguments and our basic density result to show that there 
exists a "substructure" TV of TZ such that the events in the subfamily {B t :t£TV} 
are highly correlated. The reasoning can be traced in an old paper of P. Erdos and 
A. Hajnal |12j . Also we notice that it is precisely in this step that we need to pass 
to a "substructure" . As can be seen from the examples mentioned above, this is a 
necessity rather than a coincidence. 

Step 3. We use a double counting argument to locate a "substructure" S' of S such 
that the set A contains a non-trivial portion of subsets of S' of the desired kind 
(combinatorial lines, Carlson-Simpson lines, etc.). 

Some final comments on the computational effectiveness of the method. In all 
cases of interest known to the authors, the tools used in the three steps described 
above have primitive recursive (and fairly reasonable) bounds. However, the argu- 
ment yields very poor lower bounds for the correlation of the events {B t : t € TZ'} in 
the second step. These lowers bounds are partly responsible for the Ackermannian 
behavior of the numbers DCS(fc, m, 5). 

8.2. The main dichotomy. Let fc, m G N with k ^ 2 and m ^ 1 and assume that 
for every < j3 ^ 1 the number T)CS(k,m, (3) has been defined. Hence, for every 
< S ^ 1 we may set 

(8.2) d = §(k, m, 5) = Q(k, m, d/8) and r] = r)(k, m, 6) = — 
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where ®(k,m, 5/8) is as in (|7. 16|) . Moreover let 



(8.3) 



A' = A(k,m,8/8) 



[8(5- 1 DCS(fc, m, 5/8)1 



and for every neN set 



(8.4) 



i(n, to) = CS(fc + 1, n + A', to, 2) + 1. 



We define the map G : N x N x (0, 1] -> N by the rule 



(8.5) 



G(n, to, e) = Reg(fc + l,£(n, to), 1, e). 



Also for every Carlson-Simpson tree V of [/c] <N and every 1 ^ to ^ i ^ dim(T^) let 

(8.6) Subtr^(V,i) = {R G Subtr TO (V) : i?(0) = V(0) and R(m) C V(i)}. 

As we have pointed out in §2.5, if W is a Carlson-Simpson tree of [k + 1] <N , then 
its fc-restriction W \ k can be identified as a Carlson-Simpson tree of [fc] <N . Hence, 
we may also consider the set Subtr^ l (W / \ k,i) whenever W is a Carlson-Simpson 
tree of [fc + l] <N . 

We are ready to state the main result of this section. 

Proposition 8.1. Let fc,m G N with k ^ 2 and to ^ 1 and assume that for every 
< j3 ^ 1 the number DCS(fc,m,/3) /ias been defined. 

Let < S ^ 1 and define d, 77 and A' as in (|8.2|) and ()8.3j) respectively. Also let 
net) wrai/i n 1 and N be a finite subset of N suc/i £/ia£ 



w/iere G is as in ([53]). If A C [k + 1] <N safc^es |A n [fc + ^ <5(fc + 1) ; 
/or every I G AT, £/ien £/iere ea;is£ a Carlson-Simpson tree W of [k + 1] <N wii/i 
dim(W / ) = |~?7~ 4 7j] + A' and / C {m, dim(M^)} with \I\ ^ n suc/i that either 



For the proof of Proposition 18. 1 1 we need to do some preparatory work. We start 
with the following lemma. 

Lemma 8.2. Let k, to, <5, rj, A', n and N be as in Provosition \8.1[ If A is a 

subset of [k + 1] <N satisfying \A D [k + ^ S(k + I) 1 for every I £ N, then 
there exist a subset L of N with \L\ — CS(fc + 1, |~7/ _4 n] + A',m, 2) + 1 and a 
Carlson-Simpson subtree W of [k + 1] < ' L of dimension \i]~ 4 n\ + A' such that, 
setting : [k + 1] < ' L ' x Xl — > [k + 1] <N to be the convolution operation associated 
to L and B — c^ 1 (A), the following properties hold. 

(i) For every t G [k + l] < ' i l we have dens(Bt) ^ 5 — rj 2 /2 where B t is the 
section of B at t. 



(8.7) 
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(ii) For every U G Subtr m (W \ k) we have 

(8.9) dens( f] B^j > it. 

teu 

Proof. By (|8.7[) and the definition of the function G in (|8.5[) , we may apply Lemma 
EO and we get L C N with |i| = CS(A: + 1, \^- A n\ + m, 2) + 1 and such that 
the family T := {A} is (?7 2 /2, L)-regular. Using this information and arguing as in 
the proof of Lemma T7. 2 1 we see that the first part of the lemma is satisfied. 

For part (ii), set V = [k + l] < l i L Also let 7 = 5/2 and d = |>T 4 n] + A'. Notice 
that d ^ A' — 1 = A(fe, m, 7/4) — 1. Moreover, by part (i), we have 

(8.10) dim(V) = \L\ - 1 = CS(fe + 1, d, m, 2) 

and dens(Bt) > <5 - ?7 2 /2 > 7 for every i G [fe + 1] <|L| . Hence, by Corollary 17.101 
we get a d-dimensional Carlson-Simpson subtree W of [k + l] < l i ' such that 

(8.11) dens( f] B^j ^0(fc, 771,7) 

teu\k 

for every U € Subtr m (VF). Since Subtr m (T4^ \ k) = {U \ k : U € Subtr m (W)} and 
0(k, m, 7) = Q(k, m, 7/4) = ■& the result follows. □ 

To state the next result towards the proof Proposition lS.ll we need to recall some 
notation introduced in §5. Specifically, let L be a nonempty finite subset of N and 
consider the convolution operation C£ : [k + 1] < ' L x Xj, — > [k + 1] <N associated 
to L. As in (|5.14j) . for every Carlson-Simpson subtree W of [k + l] < ' i ' and every 
x e Xl we set 

(8.12) W x = {c L {w, x) : w G W). 

Recall that, by Lemma [5.61 W x is a Carlson-Simpson subtree of [k + 1] <N with 
dim(M / a; ) = dim(M / ). We have the following lemma. 

Lemma 8.3. Let k G N with k J? 2. L be a nonempty finite subset o/N and consider 
the convolution operation cl ■ [k + l] < ' i ' x Xl —> [k + 1] <N associated to the set L. 
Also let W be a Carlson-Simpson subtree of [k + l] < l i ' and A C [k + 1] <N . We set 
B = c : L 1 (A) and for every t G [k + l] < ' i ' let B t be the section of B at t. Moreover, 
for every x G Xl let W x be as in (|8.12j) . Then the following hold. 

(i) For every i G {0, ...,dim(H / )} we have 

(8.13) ¥. xe x L Aens WAi) (A) = E teW{l) dens(B t ). 

(ii) For every 1 ^ m ^ i ^ dim(W) we have 

E, eJfi dens({l/ G SnUv° m (W x \ k) : V C A}) = E c/eSubtr o i(wfM) dens( f| B t ). 

teu 
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Proof, (i) Fix i G {0, dim(M / )}. By Lemma T5.81 we have 

(8.14) E. Te x c dens W / x ( i )(A) = dens ct (w(i)xX L )(A). 
Also notice that 

(8.15) c L (W(i)xX L )= |J c L ({t}xX L )^ |J n t . 

tew(i) tew(i) 

Next observe that W(i) C [k + I] 1 for some I G {0, \L\ — 1}. Therefore, by Fact 
15.21 we see that \Q t \ = l^t'l for every t,t' € W(i). Hence, 

(8.16) dens CL{W ( l)xXL )(A) = E teVK(i) densn t (A). 
Finally, by Lemma [S~5l for every t G W(i) we have 

(8.17) dens fit (,4) = dens {t}x x L (B) = dens(B t ). 
Combining (|87T4|) . ([8716]) and ([8717)) we conclude that (f87T73|) is satisfied, 
(ii) We fix 1 ^ m ^ i ^ dim(W) and we set 

(8.18) A={(U,x)e Subtr° n (Vy f fc, i) x X L : [4 C A} 

where, as in (|8.12p . U x — {cu(u,x) : u G U}. Also for every U G Subtr^(W t fc,i) 
and every x 6lt let 

(8.19) Au = {x G X L : (U, x) G A} 
and 

(8.20) A x = {U G Subtr^(W f A, i) : (17, x) G A} 
be the sections of A at C/ and a; respectively. Notice that 

(8.21) x e Au ^ (U,x) e A^U X c A^ x e f] B t . 

teu 

This, of course, implies that for every U G Subtr^ t (W / \ k) we have 

(8.22) Au = p| B t . 

teu 

Moreover, it is easy to see that for every x G Xl the map 

(8.23) Subtr^(W \ k, i) 3 U H> U x G Subtr°„(W^ \ k,i) 

is a bijection. Therefore, for every x G Xl we have 

|{F € SubtrjUWg rM:^C A} | 

|Subtr£,(W* r*,*)l 
\{U e$uhti° m {W \ k,i) :U X <Z A}\ 



dens({V G Svbtv° m (W x \ k, i) : V C A}) = 



\Svbbr° m (W \k,i)\ 
= dens(A x ). 
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Taking into account (|8.22|) and the above equalities we conclude that 

E, eJfL dens({l/ G Subtr° m (W x \ k, i) : V C A}) = E aeXi dens(A x ) 

= E c/eSubtr^(wrfc)dens(A ! 7) 

= IE(7eSubtr^(H'rfe)dens^ Q B t 

teu 

and the proof is completed. □ 

We are ready to proceed to the proof of Proposition 18. II 

Proof of Proposition^ We fix A C [k + 1] <N such that \A n [fc + > 5(fc + 1) ; 
for every I G JV. For notational convenience, we set d = [??~ 4 n] + A'. Let L and 
W be as in Lemma \8. 21 when applied to the fixed set A. Notice that dim(W) = d. 
Invoking the first parts of Lemmas 18.21 and 18 . 31 for every i £ {0, d} we have 

(8.24) E xeXL dens W:cil) {A) ^6- if /2. 

On the other hand, by the second parts of the aforementioned lemmas, we see that 

(8.25) E xeXL dens({V £ Subtr°„(W^ \ k, i) : V C A}) > 

for every % £ {to, ...,d}. 

Let J = {to, d} and notice that 

(8.26) \J\ = d-m + l^ \rf i n\. 
Also for every i £ J set 

(8.27) X? - {x G A L : dens Wic(i) (A) > <J + rf/2), 

(8.28) A, 1 = {a; G X L : dens Wx(i) (A) > 5 - 2??} 
and 

(8.29) X? = {.x G X L : dens({y G Subtr^W* \ k, i) : V C A}) > ?9/2}. 
Finally let J = {i £ J : dens(A l °) ^ fy 3 }. We distinguish the following cases. 
Case 1: we have | Jq| ^ |</|/2. By Lemma \2. 51 there exists G Xl such that 



?rVol v 3 \J\ ^® v 3 \v 



n 



(8.30) \{i£j 0:X0 £X?}\>^>^ > •' '" 4 >n. 

We set "/ = {ieJ:i„e X°}" and "W = W Ko ". With these choices it is easy to 
see that the first part of the proposition is satisfied. 

Case 2: we have \Jq\ < |J|/2. In this case we set Jq — J \ Jo- Let i £ J be 
arbitrary and notice that 

(8.31) dens(X°) = dens({x G X L : dens Wx(i) (A) ^5 + rf /2}) < r/ 3 . 

Combining (|8.24l) and f|8.31[) we see that dens(A^) ^ 1 — rj. On the other hand, by 
(18.251) . we have dens(X 2 ) ^ d/2. Therefore, by the choice of r) in (I8.2|) . we conclude 
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that dens(X/ n X?) ^3/4 for every i G Jo. By a second application of Lemma l2~5l 
we find x\ G Xl such that 

(8.32) \{i€J :xteXlnX?}\ > ™ > M V Vn. 

We set "7 = {i G J : sci G X\ n X?}" and 'W = W^" and we notice that with 
these choices the second part of the proposition is satisfied. The above cases are 
exhaustive and the proof is completed. □ 

8.3. Obtaining insensitive sets. Let k G N with k 2 and assume that for every 
< P ^ 1 the number DCS(fe, l,/3) has been defined. For every < 8 < 1 let 

(8.33) tfi = #i(M) = $(k,l,5) and 771 = 771 (fe, 5) = J?(Jfe,l,5) 

where <&(k, 1, 5) and ?y(fc, 1, (5) are as in ([8~2]) . Also define G*i : N x (0, 1] ->• N by 

(8.34) G?i(n,e) =G(n,l,e) 
where G is as in (|8.5p . We have the following lemma. 

Lemma 8.4. Let /c G N wii/i fc ^ 2 and assume that for every < /3 1 ifte 
number DCS(fc, l,/3) /ias oeen defined. 

Let < 8 ^ 1 and define i?i and 771 as in (|8.33|) . Also let n G N wztfi n J? 1 and 
N be a finite subset of N such that 

(8.35) \N\^G 1 (\r 1 ^(k + 1)^,^/2) 

w/iere d is as in (pOij) . Fina% HA C [fc+l] <N such that \An[k+l] l \ ^ 

for every I G iV and assume that A contains no Carlson-Simpson line of [k + 1] <N . 

Assume, moreover, that for every Carlson-Simpson tree W of [k + 1] <N we have 

(8.36) |{* G {0,...,dim(W)} : dens w(l) (A) > 5 + r/?/2} | < n. 

Then there exist a Carlson-Simpson subtree V of [k + 1] <N , a subset C of V and a 
subset J of {0, ...,dim(y)} with the following properties. 

(i) We have \ J\ > n. 

(ii) We have C = |~)r=i ^ where the set C r is (r, k + 1) -insensitive in V for 
every r G [fc]. Moreover, densy^) (C) i?i/2 /or every j G J. 

(iii) The sets A and C are disjoint. 

(iv) For every j £ J we have densy(j) (A) 8 — 5kr)i . 

Lemma \8. 41 will be reduced to Proposition 18.11 The reduction will be achieved 
using essentially the same arguments as in [28] §7.2]. 

First we need to introduce some pieces of notation. Specifically, let W be a 
Carlson-Simpson tree of [k + 1] <N and set d = dim(W). Consider the canonical 
isomorphism lw ■ [k + l] <d+1 — > W associated to W (see §2.5) and for every 
r G [k + 1] let 

(8.37) W[r] = {l W (s) ■ s G [k + l] <d+1 with |s| ^ 1 and s(0) = r) . 
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Notice that if dim(M / ) ^ 2, then W[r] is a Carlson-Simpson subtree of W with 
dim(VL[r]) = dim(M /r ) — 1. On the other hand, if W is a Carlson-Simpson line, 
then W[r] is the singleton {Ivk (?")}; we will identify in this case W[r] with 
Next observe that for every Carlson-Simpson subtree U of the fc-restriction W \ k 
of W there exists a unique Carlson-Simpson tree R of [fc+l] <N such that R \ k = U. 
We will call this unique Carlson-Simpson tree R as the extension of U and we will 
denote it by U. Notice that the extension of U is a Carlson-Simpson subtree of W . 
Also we will need the following elementary fact. 

Fact 8.5. Let W be a Carlson-Simpson tree of [k + 1] <N with dim(W) 2 and 
set V = W[k + 1]. If j G {0,....,dim(V)} and U G Subtr?(W \ k,j + 1), then 
U[k + 1] G V(J). Moreover, the map 

(8.38) Subtr?(W \ k, j + 1) 3 U H- U[k + 1] G V(j) 
is a bijection. 

We are ready to give the proof of Lemma 18.41 

Proof of Lemma \8.J\ By our assumptions we may apply Proposition 18. II for m = 1 
and we get a Carlson-Simpson subtree W of [fc + 1] <N and / C {1, ...,dini(W)} 
with |/| ^ (fc + l)n and such that dens\y(i)(A) ^ 5 — 2iji and 

(8.39) dens({[/ G Subtr?(VL f fc, i) : J7 C A}) ^ $x/2 
for every i G /. For every r G [fc + 1] let = W[r]. We set 

(8.40) V = V k+u 

(8.41) C ={J{U[k+l]:U e Subtr°(T^ \ k, i) with U C A} 

iei 

and 

(8.42) J = {j G {0, ...,dim(V)} : dens y(i) (A) ^ £- 5/2771 and j + 1 G /}. 

We claim that V, C and J are as desired. First we argue to show that \ J\ n. Let 
Jo = {j e {0, dim(F)} : j + 1 G J}. Observe that J C J and 

(8.43) |J | >(* + l)n. 

Let j G Jo \ J be arbitrary and notice that densyy)(A) < (5 — 5/cr/i. On the other 
hand, j + 1 G / and so 

(8.44) E r£[fc+1] dens Vr(i) ( J 4) = dens w(j - +1) (A) ^ <5 - 2r7i. 

Thus, there exists rj G [k] such that densy r . ^ (A) > 5 + 771. Therefore, by the 
classical pigeonhole principle, there exists ro G [k] such that 



I In \ 71 J8T431 n - I /I 

(8.45) |{jG J \J: densy^o) 04) > 6 + Vo }\ > l^AJii > ™+— A 
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Moreover, by (|8.36j) . we have 

(8.46) \{j £Jo\J: denary) (A) > S + %}[ < n. 

Combining (|8.45|) and (|8.46j) we conclude that \J\ ^ n. 

We continue with the proof of part (ii) . Let r G [fc] be arbitrary. For every I G N 
and every s G [fc + 1]' let s fc +i->r be the unique clement of [fc]' obtained by replacing 
all appearances of fc + 1 in s by r. We set 

(8.47) C r = [V(s) : s G \J[k + l]*" 1 and s fc+1 ^ r G ^(A)} 

iei 

where Iy and Iy r are the canonical isomorphisms associated to V and V r respec- 
tively. It is easy to check that C r is (r, k + 1) -insensitive in V. Next we argue to 
show that C coincides with C\ Pi ... PI Cfe. First we notice that C C C\ PI ... Pi (7fe. 
To see the other inclusion, let t G Ci Pi ... Pi Cfc be arbitrary and set s = ly (t). Let 
i be the unique element of / such that s 6 [fc + and set 

(8.48) U t = {W(0)} U {l Vr (s k+1 ^ r ) : r G [fc]}. 

Notice that f7 t G Subtr°(VF \ k,i). Next observe that, by (|Q?1) . we have W(0) £ A. 
On the other hand, we have Iy r (s fe+1 ^' r ) G A for every r G [fc] since t 6 Ci fl ... flCfc. 
This shows that Ut Q A. Observing that t = Ut[k + 1] we conclude that t <E C. 
Finally let j G J. Notice that 

(8.49) dens F( j)(C) = dens v(i) ({U[k + 1} : U G Subtr?(W [ fc, j + with [/ C A}) 
and recall that j + 1 E I. Hence, by Fact I8.5[ we have 

l t8T39l l 

(8.50) densy 0) (C*) = dens({?7 G Subtr?(W \ k,j + 1) : U C A}) ^ i? a /2 
as desired. 

The fact that A and C are disjoint follows by our assumption that A contains 
no Carlson-Simpson line of [fc + 1] <N and the definition of the set C. Finally, part 
(iv) is an immediate consequence of (|8.42l) . The proof is completed. □ 

8.4. Consequences. In this subsection we summarize what we have achieved in 
Proposition ^ . 1 1 and Lemma I5T41 We remark that the resulting statement is the first 
main step towards the proof of Theorem B. 

Corollary 8.6. Let fc G N with fc ^ 2 and assume that for every < (3 ^ 1 the 
number DCS(fc, 1, f3) has been defined. 

Let < S ^ 1 and define r\\ as in (|8.33|) . Also let neff with n ^ 1 and N be a 
finite subset of N such that 

(8.51) \N\ ^G 1 (\ Vl 4 (k + l)kn\, V l/2) 

where G q is as in (j534]) . Finally let A C [fc + l] <N such that |An[fc+l]'| ^ S(k + l) 1 
for every I G N and assume that A contains no Carlson-Simpson line of [fc + 1] <N . 
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Then there exist a Carlson-Simpson subtree V of [fc + 1] <N , a subset DofV and a 
subset I of {0, dim(T^)} with the following properties. 

(i) We have \I\ ^ n. 

(ii) We have D — Hr=i D r where the set D r is (r,k + 1) -insensitive in V for 
every r£ [k]. 

(iii) For every i £ I we have denSv(i)(A PI D) ^ (5 + nf/2)densy^(D) and 
dens y(l) (L>) > rfi/2. 

Proof. First assume that there exists a Carlson-Simpson subtree W of [k + 1] <N 
such that 

(8.52) |{t G {0,...,dim(W0} : dens w(l) (A) ^S + n 2 1 /2}\ > kn. 

In this case we set "V = W" , "I = {i G {0, dim(W)} : dens w(i) {A) ^ 5 + r) 2 /2}" 
and u D r = V" for every r £ [k]. It is clear that with these choices the result follows. 

Otherwise, by Lemma [8^41 there exist a Carlson-Simpson subtree V of [k + 1] <N , 
a subset J of {0, ...,dim(V)} with \ J\ > kn and a set C = G\ fl ... fl Cfc, where C r 
is (r, k + 1) -insensitive in V for every r£ [fc], such that 

(a) AnC = 0, 

(b) densy(j)(C) ^ #i/2 for every j e J and 

(c) densy(j) (A) ^ S — bk-qi for every j £ J 

where i?i and 771 are as in (|8.33p . In particular, for every j £ J we have 

^ 5^ * * < J " 5 ^»" + **> * 5 + 

We set Qi = V\C\ and Q r = (V \ C r ) flCiH ... n C r _i if r e {2, fc}. Clearly 
the family (Q r )r=i forms a partition of V \ C. Let j £ J be arbitrary. Applying 
Lemma [231 for "e = krji" we see that there exists rj £ [k] such that 

(8.54) densv{j)(A n Q rj ) > ((5 + 6/0771 )dens V (j)(Q rj ) 
and 

(8.55) densv(j-)(Q rj ) > (S - 5kr]i)r] 1 /4. 

Hence, there exist tq £ [fc] and a subset I of J with |/| ^ \ J\/k n such that rt = tq 
for every iel.We set "D = Q ro " . Also let "D r = C r " if r < r , "£> ro = V \ C ro " 
and "D r = V™ ii r > vq. Clearly D r is (r, k + l)-insensitive in V for every r £ [fc] 
and Di fl ... fl Dk = D. Moreover, by the choice of 771, for every i £ I we have 

EH 

(8.56) dew v( i)(AnD) > (5 + 6fc7ii)dens y(i) (£>) ^ (<S + ?7i/2)dens y(i) (£>) 
and 

T55I 



.57) dens y(l) (Z?) ^ (5 - 5^)r)i/4 > »tf/2. 

The proof is thus completed. □ 



A DENSITY VERSION OF THE CARLSON-SIMPSON THEOREM 



45 



9. An EXHAUSTION PROCEDURE: ACHIEVING THE DENSITY INCREMENT 

9.1. Motivation. As we have seen in Corollary 18.61 if a dense subset A of [k + 1] <N 
fails to contain a Carlson-Simpson line, then there exist a Carlson-Simpson tree 
V of [k + 1] <N and a structured subset D of V (recall that D is the intersection of 
relatively few insensitive sets) that correlates with the set A more than expected 
in many levels of V. Our goal in this section is to use this information to achieve 
density increment for the set A. A natural strategy for doing so - initiated by 
M. Ajtai and E. Szemeredi in [1] - is to produce an "almost tilling" of the set D, 
that is, to construct a collection V of pairwise disjoint Carlson-Simpson trees of 
sufficiently large dimension which are all contained in D and are such that the set 
D \ UV is essentially negligible. Once this is done, one then expects to be able to 
find a Carlson-Simpson tree W belonging to the "almost tilling" V such that the 
density of the set A has been significantly increased in sufficiently many levels of 
W. However, as is shown below, this is not possible in general. 

Example 9.1. Let m e N with m ^ 1 and < e ^ 1 be arbitrary. Also let q, I £ N 
with q £ 1 and such that 2 e (2 l — 1)2~ 9 ^ e. With these choices it is possible 
to select a family {x t £ [2]«~ £ : t £ [2] <l } such that for every t,t' £ [2] <e with 
t 7^ t' we have that x t ^ x t >. For every i £ {0, ...,£ — 1} and every t £ [2] 1 we set 
z\ = t~(l )^Xf and zf = t" (2 l ~ 1 )^ xt and we define 

(9.1) T={z\:t£ [2} <e and j £ [2]}. 

Notice that T C [2] q and dens(J r ) < e. Also observe that {z},z^} n [z],, zf,} = 
provided that t^t'. We set D = [2] <l U [2] q U ... U [2} q+m - 1 and 

(9.2) A = [2} <£ U {y~s : y £ [2] q \ T and s £ [2] <m }. 

It is clear that D is a highly structured subset of [2] <N - it is the union of certain 
levels of [2] <N - and A is a subset of D of relative density at least 1 — e. Next for 
every t £ [2] <l let 

(9.3) V t = {t} U {4~a : j £ [2] and s £ [2] <m }. 

Observe that V = {Vt : t £ [2} <e } is a family of pairwise disjoint m-dimensional 
Carlson-Simpson trees which are all contained in D. Also notice that, no matter 
how large I is, V is maximal, that is, the set D \ UV contains no Carlson-Simpson 
tree of dimension m. However, Vt f~l A is the singleton {t} for every t £ [2] <e . 

The above example shows that, in our context, the problem of achieving the den- 
sity increment cannot be solved by merely producing an arbitrary "almost tilling" 
of the structured set D. To overcome this obstacle we devise a refined exhaustion 
procedure that can be roughly described as follows. At each step of the process 
we are given a subset D' of D and we produce a collection £ of Carlson-Simpson 
trees of sufficiently large dimension which are all contained in D' . These Carlson- 
Simpson trees are not pairwise disjoint since we are not aiming at producing a 
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tilling. Instead, what we are really interesting in is whether a sufficient portion 
of them behaves "as expected" . If this is the case, then we can easily achieve the 
density increment. Otherwise, using coloring arguments, we can show that for "al- 
most every" Carlson-Simpson tree V of the collection 8 , the restriction of our set 
A in V is quite "thin" and in a very canonical way. We then remove from D' an 
appropriately chosen subset of U£ and we repeat the argument for the resulting 
set. The above process is shown that it will eventually terminate, completing thus 
the proof of this step. 

At a technical level, in order to execute the steps described above we need to 
represent any subset of [k + 1] <N as a family of measurable events indexed by an 
appropriately chosen Carlson-Simpson tree of [fc + l] <N . The philosophy is identical 
to that in §8.2. However, due to the recursive nature of the process, we need to 
work with iterated convolutions. In particular, the reader is advised to review the 
material in §6 before studying this section. 

9.2. The main result. Let k G N with k ^ 2 and assume that for every integer 
I 1 and every < ft $C 1 the number DCS(fc, l,j3) has been defined. This 
assumption permits us to introduce some numerical invariants. Specifically, for 
every integer to ^ 1 and every < 7 ^ 1 we set 

r512 

(9.4) m = fh(m, 7)= — ^-1 
and 

(9.5) M = A(Jfe, m, 7 2 /32) ™ r^DCS(fc, to, 7 2 /32) 

7 

Also let 

(9.6) a — a(k, to, 7) = 6(k, rh, 7 2 /8) and po = po(k, to, 7) = [a^ 1 ] 

where 9(k, to,7 2 /8) is as in (|7.45|) . Finally we define, recursively, three sequences 
(rip), (rip) and (N p ) in N - also depending on the parameters m and 7 - by the rule 
n Q — n o — No — and 



(9.7) ^ 



p+i = (N P + l)m + N p , 



n l+i = CS(fc + 1, n p+1 , m,m + 1), 
N p+1 = CS(fc + l,max{n 2 +1 ,M},m,2). 

We are mainly interested in the sequence (N p ). The sequences (n p ) and (n p ) are 
auxiliary ones which will be used in the proof of the following lemma. 

Lemma 9.1. Let k G N with k 2 and assume that for every integer I 1 and 
every < j3 ^ 1 the number DCS(fc, I, ft) has been defined. 

Let < 7, (5 1 and r G [k]. Also let V be a Carlson-Simpson tree of [k + 1] <N 
and I be a nonempty subset of {0, dim(V^)}. Assume that we are given subsets 
A, D r , Dk of [k + 1] <N with the following properties. 
(a) The set D r is (r, k + 1) -insensitive in V . 
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(b) We have dens v ^(D r n ... C\D k n A) > (S + 2^f)deas v ^)(D r n ... f)D k ) and 
densv(i)(D r H ... n Z?fc) ^ 27 for every i £ I. 
Finally, let m G N wi/i to ^ 1 and suppose that 

(9.8) |/| ^Reg(fc + l,iV P0 + l,2,7 2 /2) 



where po and N po are defined in (|9.6|) and (|9.7|) respectively for the parameters 
to and 7. T/ien i/iere exist a Carlson-Simpson tree W of V and a subset I' of 
{0, dim(W / )} of cardinality to with the following properties. If r < k, then 

(9.9) densw/ W (D r+1 n ... n D k n A) ^ (S + 7/2)dens w(l) (L> r+ i n ... n D k ) 
and 

(9.10) dens w(i) (D r+1 n...nD t )^ 
/or every i £ I' . On the other hand if r = k, then 

(9.11) dens w{l) (A) ^ 6 + 7/2 
/or every t G J'. 

The proof of Lemma |9~T1 will be given in §9.3. As the reader might have already 
guessed, Lemma 19. II is the main result of this section and incorporates the exhaus- 
tion procedure outlined in §9.1. It will be used in §9.4 where we shall achieve the 
density increment. 

9.3. Proof of Lemma 19. li The first step of the proof relies on an application of 
the regularity lemma presented in §3. Precisely, let B — D r+ i n ... D D k if r < k; 
otherwise, let B — V. Identifying the Carlson-Simpson tree V with [fc+l] <dlm ( v ) +1 
via the canonical isomorphism Iy (see §2.5), we may apply Lemma 13.21 to the family 
T = {D r n B, D r n B n A} and we get a subset L of I of cardinality N po + 1 such 
that T is (7 2 /2, L)-regular. We set 

(9.12) A = c-V(A), B° = c- L ] v {B) and D Q = c^ v (D r ). 

The fact that _F is (7 2 /2, L)-regular and conditions (a) and (b) in the statement 
of the lemma have some consequences which are isolated in the following fact. Its 
proof is similar to the proof of Lemma \7. 2 1 and is left to the reader. 

Fact 9.2. For every t e [fc + l] <|L| letA$,B$ and be the sections at t of A , B° 
and D° respectively. Then for every t,t' G [k + 1] < ' L ' the following hold. 

(i) Ift,t' are (r, k + 1)- equivalent (see §2.6), then D® and D®, coincide. 

(ii) We have den$x L {D° D £ t ° D A Q t ) > (5 + j)densx L {D Q t HB°). 

(iii) We have densx L (-D? n B° t ) > 7. 

We are ready to proceed to the main part of the proof. We will argue by contra- 
diction. In particular, assuming that the lemma is not satisfied, we shall determine 
an integer d £ [po] and we shall construct 
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(1) a (fc + Incompatible pair ((L„)^ =0 , {V n ) d l=0 ) with Lq = L and Vq = V, and 

(2) for every p £ [d], every £ £ [p] and every s £ [k + 1]<I l pI a family Q l s ' p of 
subsets of X Lp , where L p = (L n ) P l=0 and V p = {V n ) p n=0 . 

The construction is done recursively so that, setting 

(9.13) A p = c Lp V p (A), B p = c^ p Vp (B) and D p = t£ tWp (D r ), 

for every p £ [d] the following conditions are satisfied. 
(CI) The set L p has cardinality N Pa - p + 1. 

(C2) For every £ £ [p] and every s £ [k + 1]<I l p the family Q e s p consists of 

pairwise disjoint subsets of the section D P of D p at s. 
(C3) For every s £ [k + 1]<I l pI the sets L)Ql' p , l)Q p - p are pairwise disjoint. 
(C4) For every £ £ [p], every pair s,s' £ [k + 1]<I l pI with the same length and 

every Q £ Q^ p and Q' £ Q s f we have densA- Lp (Q) — densx Lp (Q')- 
(C5) For every £ £ [p] and every s £ [k + 1]<I l p we say that an element Q of 

Q e s ' p is good provided that dcns Q (D p DB P nA p ) ^ (S + j/2)dcns Q (D p r\B P ) 

and dens Q (D p n B p ) > 7 3 /256. Then, setting 

(9.14) g** = {Q £ Qi' p : Q is good} 
we have 

(9.15) < J—. 

\Qs P \ 256 

(C6) For every i £ [p] and every s £ [k + 1]<I l pI we have densx Lp (UQ e s ' p ) ^ a 

where a is as in (|9.6|) . 
(C7) For every £ £ [p] and every s, s' £ [k + 1]<I l pI we have Q l s p — Q S ', F if s and 

s' are (r, fc + l)-equivalent. 
(C8) If p = d, then there exists s G [fc + l]<l Ld l such that 

(9.16) densx Ld (d* \ (J UQ^) < 7 2 /8. 

i=\ 

Assuming that the above construction has been carried out, let us derive the 
contradiction. Let So be as in (C8). By Corollary 16.81 Corollary 16.91 and Fact 19. 2\ 
we see that 

(9.17) densx Ld (D d SQ n B d Sg n <) ^ (<5 + 7 )densx Ld (£>f n ) 
and 

(9.18) denB* td (D2 n£? )> 7 . 

For every £ G [d] we set C £ = UQ% d - By (j9~T6| . the family {C e : £ £ [d]} is an 
"almost cover" of D d Q CiB d o . Hence, invoking f|9.17[) and (|9.18l) and applying Lemma 
for "e = 7 / 4" , we may find G M such that 



(9.19) denser {D d So H < D <) ^ (<J + 3 7 /4)dens c , o (Z>* n B?„) 
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and 

(9.20) dens Cf0 (^ ni? s d )^7 2 /16. 

Next observe that, by conditions (C2) and (C4), the family Q l s a ,d is a partition of 
Ci into sets of equal size. Taking into account this observation and the estimates in 
(|9.19[) and (I9.20[) , by a second application of Lemma [2~^1 for "e = 7 /4" , we conclude 
that 

(9.21) ¥q-± ^ -!—. 

1 ' \QtV d \ 256 

This contradicts (|9.15[) . as desired. 

The rest of the proof is devoted to the description of the recursive construction. 
For "p = 0" we set L = L and Vq — V. Let p G {0, ...,po} an d assume that the 
construction has been carried out up to p so that conditions (C1)-(C8) are satisfied. 
We distinguish the following cases. 

Case 1: p = pq. Notice first that, by (CI), the set L p is a singleton. Therefore, the 
set [k + l] < l i p is the singleton {0}. We set sq = and d = po. With these choices, 
the recursive construction will be completed once we show that the estimate in 
(|9.16p is satisfied. This is, however, an immediate consequence of conditions (C3) 
and (C6) and the choice of a and po in (|9.6I) . 

Case 2: there exists s G [k + 1]<I l pI such that 

(9-22) dens XLp (d* \ (j Ufijf ) < 7 2 /8- 

In this case we set d = p and we terminate the construction. 
Case 3: we have that p < po and 

(9.23) den SA - Lp (l£ \ (j > j 2 /8 

for every s G [k + l] < l i pl . If p ^ 1, then for every s G [k + l] < l i pl we set 

(9.24) r s = DP \ |J UQS* 

i=\ 

Otherwise, let T s = D° s . The following fact follows by ((OBI) and condition (C7) if 
p ^ 1, and by Fact O if p = 0. 



Fact 9.3. For every s G [fe + 1] < I l j'I we have dcnsx Lp (r s ) ^ 7 2 /8. Moreover, if 
s, s' G [fe + 1] < I- l pI are (r, fc + 1) -equivalent, then T s = T s /. 

By Fact 19.31 condition (CI), the choice of the sequence (N p ) in (|9.7p and the 
choice of a in (|9.6I) , we may apply Corollary 17.111 to get a Carlson-Simpson tree S 
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of [k + 1]<I l pI with dim(S) = rip _ p such that for every m-dimensional Carlson- 
Simpson subtree U of S, setting 

(9.25) T V = f) r„ 

seu 

we have densx Lp (IV) ^ 

For every i £ {0, fh} and every Carlson-Simpson subtree U of S of dimension 
m let Gi t \j be the set of all x e satisfying 

(PI) dens a(i)x{x} (£)f n n A?) ^ (5 + 7/2)dens c/(4)x{x} ( J D p n BP) and 
(P2) dens^^^^niJf) ^ 7 3 /256. 

Our assumption that the lemma is not satisfied reduces to the following property 
of the sets d : u ■ 

Claim 9.4. For every fh- dimensional Carlson-Simpson subtree U of S there exists 
i £ {0, fh} such that densrv (Gi,u) < 7 3 /256. 

Proof. We will argue by contradiction. So, assume that there exists a Carlson- 
Simpson subtree U of S of dimension fh such that for every i e {0, fh} we have 
densrv(Gi,!7) ^ 7 3 /256. For every x 6 Ty let 

(9.26) J x = {ie{0,..,m}:xeG l , y }. 

By Lemma [231 setting G = {x e IV : |J X | ^ (m + 1)7 3 /512}, we have that 
densrv(G) 7 3 /512. This implies, in particular, that the set G is nonempty. We 
select x £ G. By the choice of fh in (|9.4[) we have that |/ x | ^ m. We define 
W = {cl p ,v p (s,x) : s 6 [/}. By Lemma [rT4l IF is a Carlson-Simpson subtree of V 
of dimension fh. Applying Lemma T6.5I twice for the sets D r D B D A and D r n -B, 
for every i S {0, m} we have 

(9.27) dens w(i) (D r H B C\ A) — dens t/(i)x{x} (D p n £ p n A p ) 
and 

(9.28) dens w(i) (D r nB) = dens^x^!^ n B p ). 

The above equalities and the fact that x e Gijj for every i e J x yield that 

(9.29) densv^) (D r H 5 n A) > (6 + 7 /2)dens H /( l ) (£»r n B) 
and 

7 3 

(9.30) dens^p.nB) ^ 

for every i E I x . Finally, observe that W is a subset of D r since x £ IV. It is then 
clear that W and I x satisfy the conclusion of the lemma in contradiction with our 
assumption. □ 
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We are now in the position to start the process of selecting the new objects of 
the recursive construction. 

Step 1: selection of V p +i and L p+ \. Firstly, we will use a coloring argument to 
control the integer i obtain by Claim WM Specifically, by the choice of the sequence 
(rip) in (|9.7|) and Claim IST4l we may apply Theorem 14.11 to obtain i £ {0, ...,m} 
and a Carlson-Simpson subtree T of S of dimension n^ o _ p such that for every 
m-dimensional Carlson-Simpson subtree U of T we have densr„ {Gi ,u) < 7 3 /256. 
We define 

(9.31) V p+1 =TandL p+1 = {i +i(*o + 1) ■ j £ {0, N po _ {p+1) }} . 

Notice that the pair ((.£<n)^=o, (YnYr^X) is (k + Incompatible. This follows by our 
inductive assumptions and the choice of the sequence (nV) in (|9.7|) . Moreover, the 
cardinality of the set L p+ i is Ap -( P +i) + 1- Hence, with these choices, condition 
(CI) is satisfied. For notational simplicity, in what follows by q p +i we shall denote 
the quotient map associated to the pair (L p+ i, V p+ i) defined in (I6.5[) . 

Step 2: selection of the families Qf + ' pJr . This is the most important part of 
the recursive selection. The members of the families qp +1 ' p+1 are, essentially, 
the sets Tu where U ranges over all m-dimensional Carlson-Simpson subtrees of 
Vp+i. However, in order to carry out the construction, we have to group them in a 
canonical way. We proceed to the details. 

Consider the canonical isomorphism Iy p+1 : [k + i]< dlm ( 1/ p+i)+ 1 — > V p +i defined 
in §2.5; for convenience it will be denoted by I. For every j £ {0, |£ p +i — 1} 
and every t £ [k + Vf we set 

(9.32) tit = {c Lp+uVp+1 (t,x) : x £ X Lp+1 ) C V p+1 {i +j(i + !))• 

If j ^ 1, then let £* be the unique initial segment of t of length j — 1 and set 

(9.33) K t = {r 1 (ti;)' , t(j - l):weO t ,}c [fc + l]^ l0+1 ) ; 

otherwise, let K = {0}. Notice that K t — {cl p+1 (t*, x)"t(j — 1) : x £ Xl p+1 }- 
Finally for every s £ K t let C s — s"[k + l] <m + 1 and set 

(9.34) V t = {l{C s ):s£K t }. 

Before we analyze the above definitions, let us give a specific example. For con- 
creteness take k — 3 and assume, for notational simplicity, that Vp+i is of the form 
[4] <n where n is large enough compared to io (hence, the map I is the identity). 
Consider the sequence t — (1,2,1) and observe that t* — (1,2). Notice that Q( is 
the subset of [4] 4l0+3 consisting of all finite sequences x such that x(io) = t(0) = 1, 
x(2i + 1) = t(l) = 2 and a;(3i + 2) = t(2) = 1. On the other hand, the set K t 
is the subset of [4] 3l °+ 3 consisting of all sequences x such that x(io) = i(0) = 1, 
x(2io + 1) = t(l) = 2 and x(3io + 2) = t(2) = 1. It is then easy to see that in this 
specific case the family {U(io) : U £ Vt] forms a partition of the set f2 t . This is, 
actually, a general property as is shown in the following fact. 
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Fact 9.5. Lett G [A;+l] < ' Lp+1 ' be arbitrary. Then the family Vt consists of pairwise 
disjoint fh- dimensional Carlson-Simpson subtrees of V p +i . Moreover, 

(9.35) Q t = \J U{i ). 

UGPt 

Proof. It is clear that the family Vt consists of pairwise disjoint Carlson-Simpson 
trees. Moreover, by the choice of L p+ \ in (|9.31|) . we have 

(9.36) I _1 ( fi t) = { S ~V ■ s £ K t &n&y e[k + \} ia ) 

= (J C ^) = U rl ( C/ )(*o) 

sEK t U€P t 

and the proof is completed. □ 

We record, for future use, another property of the family Vt- 

Fact 9.6. Let t,t' G [k + l] < l L p+ 1 l with the same length. Then for every U G Vt 
and every U' G Vt' we have denso t \U(io)) = densn., {U'(io)) ■ 

Proof. By Fact 19.51 we have U(io) C Q t and U'(io) C fl t i while, by Fact 15.21 we 
have |fi t | = |fi t /|. Noticing that \U(i )\ = \U'(i )\ = (fc + l) io the result follows. □ 

We are ready to define the families QP +1 < P+1 . Specifically, fix t G [k + 1] < I l p+ 1 I. 
For every U G Vt and every x G Tjj let 

(9.37) Ql U = {x} x {x G X Lp+1 : c Lp+1 , Vp+1 (t, x) G U(i )} C X Lp+1 . 
By Fact 19.51 the set U(io) is contained in fl t . Hence, 

(9-38) Wxgf-q P i(^)x{x}). 
We define 

(9.39) = {Qf> u : [/ g ^ t and x G IV}. 
This completes the second step of the recursive selection. 

Step 3: selection of the families Q e t :P+1 for every i G [p\. In this step we will 
not introduce something new but rather "copy" in the space Xj_, p+1 what we have 
constructed so far. In particular, this step is meaningful only if p J? 1. 

Specifically, let p > 1 and fix t € [k + l] < l i p+ 1 l and t G [p]. For every s G fit and 
every Q G Qf ' p let 

(9.40) C s /- Q = Qx{ie X Lp+1 : c ip+1 , Vp+1 (f,ar) = s} C X Lp+1 . 
Notice that 

(9.41) {t}xC t ^ Q = q^ 1 ({ S }xQ). 
We define 

(9.42) Q e t ' p+1 = {C S /' Q :seQ t and Q G 

This completes the third, and final, step of the recursive selection. 
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Step 4 : verification of the inductive assumptions. Recall that condition (CI) has 
already been verified in Step 1. Also observe that condition (C8) is meaningless in 
this case. Conditions (C2) up to (C7) will be verified in the following claims. 

Claim 9.7. For every £ 6 [p + 1] and every t £ [k + l]<l L p+!l the family Q l t ,p+1 
consists of pairwise disjoint subsets of D P+1 . That is, condition (C2) is satisfied. 

Proof. Fix t E [k + 1]<I l p+iI. First assume that £ G [p]. Invoking ([00]) . (|02D 
and our inductive assumptions, we see that the family Q e t ' p+1 consists of pairwise 
disjoint sets. Fix C s t ' l ' Q £ Q £ t < p+1 for some s £ fit and Q £ Q^ p - Using our inductive 
assumptions once again, we see that Q C D p or equivalently {s} x Q C D p . Hence, 
by (jOT|) and Fact E3 

(9.43) {i} x C t sAQ C q^D") = D p+1 . 

Now we treat the case u £ = p + 1" . Let U, U' £ V t , x £ T v and x' G Tu>. We 
will show that the sets Q?' an d ' are disjoint provided that (U, x) ^ ([/', x'). 
Indeed, if U = U', then necessarily x ^ x' which implies that the sets Q*' U and 
,C/ are disjoint. Otherwise, by Fact 19. 5| the Carlson-Simpson trees U and U' 
are disjoint. In particular, the sets U(io) and U'(io) are disjoint yielding that 
Q*' u n Q*'' U ' = 0. What remains is to check that Q*' U C for every U £ V t 

and every x G Tjj. So, fix U £ Vt and x G Tjj. By (|9.38[1 and Fact 16. 3( we conclude 
that 

(9.44) {t} x Q*' U G q^DP) = D p+1 

and the proof is completed. □ 

Claim 9.8. For every t£ [k + l]< L p+^ the sets UQ t 1,p+1 , UQ P+1 ' P+1 are pairwise 
disjoint. That is, condition (C3) is satisfied. 

Proof. Fix t £ [k + 1]<I- l p+iI and £ £ [p]. Let £' £ [p + 1] with £' ^ £. We need to 
show that the sets UQ e t :P+1 and UQ^ ,p+1 are disjoint. If £' ^ p, then this follows 
immediately from (|9.40p and our inductive assumptions. So, assume that £' =p+l 
and let U £ Vt and x G T v . By (f9724]> and (|9T25]> . we have that x £ UQf' p for every 
s £ U. Using this observation, the result follows from (|9.38j) and (|9.41[) . □ 

Claim 9.9. For every £ £ [p + 1], every t,t' £ [k + 1] < I l p+ 1 I with the same length 
and every Q £ Q e t ' p+1 and 9' G Q g t ', p+1 we have densjc Lp+i (9) = densx Lp+1 (9'). 
That is, condition (C4) is satisfied. 

Proof. If £ £ [p] , then by (|9.4ip there exist s G fit and s' £ fif as well as Q £ Q e s > p 
and Q' £ Q e f such that {(}x6 = q"^ ({s} x Q) and {*'} x 9' = q"^ ({s'} x Q') . 
By Fact l5.2[ we have that s and s' have the same length and |fit| = |fif|. Therefore, 
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by our inductive assumptions, 

(9.45) densn t xx Lp ({s} X Q) = dens {s}xXLp ({s} x Q) 

= ^ denS ^ (0) = ^ denS ^ (0,) 

= |^j denS { s '}x^P ^ S '} X ^0 

= densn t ,xx Lp ({s'} x Q'). 
Applying Lemma 16.61 we conclude that 

(9.46) dens A - Lp+i (6) = dens {t}xXLp+i ({*} x 9) 

densn tX x Lp ({s} x Q) 

densn t , x x Lp ({«'} x Q') 
= dens {t , }xXLp+1 ({t'} x 6') = dens XLp+1 (6'). 

If£=P+l, then by (jOSf there exist {/ <E V t and [/' G V t > as well as x G T v and 
x' G such that {t}x9 = q"^ (U(i ) x {x}) and {i'}x6 = q"^ (U'(i ) X {x'» . 
By Fact 19.61 we have 



(9.47) dens^ xXLp ([/(zo) x {x}) - ——densn t (U{i )) 

= |^-| dens^, (U'(i )) 

= densa t , x x Lp (f '(«o) x {x'» . 

Using (|9.47[) . Lemma IBTBl and arguing precisely as in the previous case, we see that 
densx L (©) = dcnsx L (©') and the proof is completed. □ 

Claim 9.10. For every £ £ [p + 1] and every t G [fc + l] < l L p+ 1 l we /iowe 

|Cf'f +1 | 7 3 

(M8) f|wj < as- 

T/ioi is, condition (C5) is satisfied. 

Proof. Fix t G [fc + l] < l i p+ 1 L Assume first that € G [p]. For every s £ fit let 

(9.49) Q s = {C S /> Q : Q G Q l f) and £ s = H Q s . 

By (jnH21), the family {Q s : s G fl t } is a partition of Qf ,p+1 . The family {£ s : s G fi t } 
is the induced partition of Q t ' p+1 . Moreover, for every s G fit let 

(9.50) B s = {C S /' Q : Q G gf*}. 
Subclaim 9.11. For every s G fl t we have Q s = B s . 
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Proof of Subclaim WTJX Fix s G Sl t and let G Q s be arbitrary. By (|9.41|) . the 
map Qf' p 3 Q C"' ' G Q s is a bijection. Hence, there exists a unique Q G <2^ ,p 
such that 6 = C^' e ' Q . Using (|9.41[) once again, we see that {t} x9 = ({s} x Q) . 
Since s G f2 t , by Corollarv l6.10[ we get 

(9.51) dens e (£>? +1 nB? +1 ) = dens {t}xe {D p+1 n B p+1 ) 

= den \-(MxQ)(%+i(^n^)) 

= dei iS{s}xQ (D^nB p ) 
= densgpf nSJ). 

Similarly, 

(9.52) dens (D P+1 n n A P+1 ) = dens Q (L> p flBfn AJ). 

Using (|9.5ip and ()9.52|) and invoking the definition of a good set described in 
condition (C5), we conclude that 6 G Q s if and only if Q G Q l s ,v ■ This is equivalent 
to say that Q s = B s . □ 

We arc ready to complete the proof for the case "£ G [p]" . Indeed, we have already 
pointed out that the map Q e s ' p 9Qh> (j^ e <Q g Q s is a bijection. Therefore, using 
our inductive assumptions, we see that 

( , 53) j^l = E w ici - - ^ m 



O; "I 12*1 |Q^ +1 | ^1 CI |Q^ +1 | 

# p i ie.i 7 3 / le-i \ 7 3 



^ lof' p l lof p+1 l 256 V ^ 



sentl Q^| |S^ +1 | 256 V^| S ^ + V 256' 

Now we treat the case ^ = p + 1. The argument is similar. Specifically, for every 
U <E Vt let 

(9.54) = {Q^ : x G Ty} and ^ = Q l * +1 n Qu- 

By (|9.39p . the family {Qj/ : £/ G Vt} is a partition of Q £ t ' p+1 and the family 
: U G Pf} is the induced partition of Q t ' p+1 . Also, for every U G Vt let 

(9.55) By = {Q^ u : x G G Jo ,c/}. 

Recall that G^u is the set of all x G Tjj satisfying properties (PI) and (P2). 
Moreover, by the choice of io in Step 1, we have that densrv {Gi ,u) < 7 3 /256. We 
have the following analogue of Subclaim 19. Ill 



Subclaim 9.12. For every U G Vt we have Qjj = B. 



u- 



Proof of Subclaim\9JM Fix U G V t and let G Qu be arbitrary. By ([938]) , the 
map Tu 3 x H> <3* ,C/ G is a bijection. Hence, there exists a unique x G 
such that = 6. Invoking (|935| once again, {t} x Q* ,c/ = qj+i (J7(«o) X {x}). 

Moreover, by Fact 19.51 we have U(io) Q Qt- By the previous remarks and Corollary 
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I6.10| and arguing precisely as in the proof of Subclaim 19. 11[ we see that O 6 Gu if 
and only if x £ Gi ,c/- □ 

With Subclaim I9.12l at our disposal, we are ready complete the proof for the case 
"£ = p + 1" . We have already pointed out that the map IV 3 x H> Q*' U £ Qu is a 
bijection. Therefore, using our inductive assumptions, we conclude 



->p+i,p+i I 



V Igjo^ \Qu\ 

h> t |rv| i2r l,p+1 i 

7 3 / \ - \Qu\ \ _ 7 3 
256 V ^ lo? +1 ' p+ V 256' 

The proof of Claim 19.101 is completed. □ 

Claim 9.13. For every i £ [p + 1] and every t £ [fc + 1] < I l p+ 1 I we Ziawe </ia< 
densx L +i (UQ(' p+1 ) a. TTiai is, condition (C6) is satisfied. 

Proof. Let t £ [fe + l] < l i f+ 1 l be arbitrary. Assume, first, that £ £ [p]. By the 
inductive assumptions, we have 

(9.57) denso tX A Lp ( (J W X UQ^) > a. 

sesi t 

On the other hand, 



(9.58) <£k( UWxUQ^J = q^i( IJ |J W x Q 

= U U q;+i(WxQ) 



MM 



U U « xC < 



t 



{i} x UQ 



By (|9~57|) . ([938]) and Lemma ESI we conclude that 

(9.59) dens XLp+i (UQ^ P+1 ) = dens {t}xXLp+i ({t} x UQ^ +1 ) ^ a. 

Next assume that £ = p + 1. By Fact 19.51 the family {J7(io) : U £ 7^} forms 
a partition of the set £l t . Moreover, as we have already pointed out immediately 
after (|9.25|) . we have dens;^ (IV) ^ a. Hence, 

(9.60) dens ntXXLp ( (J U(i ) x iv) > a. 
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Notice that 

(9- 61 ) %lx( U U(i )xru) = q;i( |J |J U(i )x{x} 
uev t ueVt xeiv 

= U U q P +i(^o)x{x}) 

^ U Uw x ^ 

t/e:p t xer r/ 
™ {*} x UQf+W. 
Combining (|9.60p . (j9.61[) and applying Lemma 16.61 we obtain 
(9.62) dens XLp+i (UQ? +1 ^ +1 ) = dens {t}xXLp+i ({*} x UQ? +1 ^ +1 ) > a 

and the proof is completed. □ 

Claim 9.14. For every £ £ [p + 1] and every t,i' £ [k + l] < l i p+ 1 l if t and t' are 
(r, k + 1) -equivalent, then Ql' p+1 = Qff^ 1 . That is, condition (C7) is satisfied. 

Proof. We fix i, t' £ [fc + 1] < I l p+ 1 I which are (r, fe + l)-equivalent. For every s £ fit 
let F s * = {x £ Xi p+1 : c Lp+li y p+1 (i, x) = s}. Respectively, for every s' £ fi t / let 
Y$ = {x £ A Lp+1 : c Lp+1 ,y p+1 (t',x) = s'}. Let g t , t > ■ fit -> ^t' be the bijection 
obtained by Fact 15.41 and recall that for every s £ fit we have 



( 9 - 63 ) Y s=K,Asy 

Since t and t' are (r, k + 1) -equivalent, by Fact 15.41 again, we see that s and gt,t'(s) 
are also (r, fc + l)-equivalent for every s £ fit- 

After this preliminary discussion, we are ready for the main argument. First 
assume that £ £ [p]. For every s £ fit and every s' £ f^/ let 

(9.64) Q* = {C^' Q : Q £ Qi< p } and Q*', = {C t s /' AQ ' : Q' £ Q e f}. 

By ([9421 . the families {Q* : s £ fi t } and {Q*' : s' £ fl t '} form partitions of Qj' p+1 
and Q l t ', p+1 respectively. By (I9.40|) . for every s G fit and every Q £ Q e s tP we have 

(9.65) C s t AQ = Qx F s *. 

Of course, we have the same equality for t! ', that is, for every s' £ f2 t / and every 
Q' £ Qff it holds that 

(9.66) C$ AQ ' = Q' xY$. 



Moreover, invoking our inductive assumptions, for every s £ fit we have 

"9t,t'( s )' 



(9.67) Q'/ = Ql' P , 
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Hence, 

(9.68) Q* s = {Ct £ ' Q : Q 6 Q e f} ^ {Q x : Q e 



i9763l 



9t,t'( s ) 



rn„vf .^^o^ l™r>f 



Since <?t.e is a bijection we conclude that Q t — Q t , 

Before we proceed we need, first, to introduce some terminology. Specifically, 
let U and U' be two Carlson-Simpson trees of the same dimension and consider 
the canonical isomorphism ljj,u' associated to the pair U, U' described in §2.5. We 
say that U and U' are (r, k + l)-equivalent if for every s 6 U we have that s and 
lu.u'( s ) are ( r i k + 1) -equivalent. 

Now assume that £ — p + 1 and let Kt and iff be as in (|9 . 33|) . Our first goal is 
to define a map h : K t — > if t / with the following properties. 

(a) The map ft. is a bijection. 

(b) The map h preserves the lexicographical order. 

(c) For every s £ K t we have that s and ft(s) are (r, fc + 1) -equivalent. 

If t = 0, then ft is the identity. Assume that |i| = |f'| = j for some j ^ 1 and recall 
that t* and are the initial segments of t and t' respectively of length j — 1. Let 
fl 1 *,,*' : ~ > Of be the map obtained by Fact 15.41 We define 

(9.69) ft(r>no- - 1)) = r x ( 9u , K (w)rt'(j - 1) 

for every w 6 f2 ti . With this choice the aforementioned properties of ft follow 
readily from the properties of <?t*,t' • 

The map ft induces a function / : Vt — > T^e defined by the rule 

(9-70) f(l(C s ))=I(C h(s) ). 

We isolate, for future use, the following properties of /. Their verification is 
straightforward. 

(d) The function / is a bijection. 

(e) For every U, U' € V% if U(0) <i cx U'(0), then f(U)(0) < lcx f(U')(0). 

(f) For every U e ?t we have that U and f(U) are (r, k + 1) -equivalent. 

Also observe that for every U £ Vt we have 
(9.71) T u = T f{u) . 

This follows by Fact 19.31 and property (f) above. More important, however, is the 
relation of the function / with the map gt,t>- Specifically, for every U &Vt we have 

(9-72) {gt,t'(s):seU(i )} = f(U)(i ). 

To see this notice, first, that for every s € K t the set C s (io) is an interval, in the 
lexicographical order, of [k + l} 1 for some I S {0, dim(14>+i)} depending only on 
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the length of t (precisely, I = io + (io + l)|t|). Hence, by (|9.34|) . for every U &Vt the 
set U(io) is an interval of V p +i(l) for the same I. Therefore, equality (|9.71j) follows 
by Fact 19.51 and property (e) isolated above. 
For every U G Vt and every U' € Vp we set 

(9.73) Q\j = {Q*' U ixeTp} and = {Q*!' u> : x' G ]». 

By lpQ9]l . the families {Q\, : U & V t ] and {fi£ 7 : £/' G TV} form partitions of 
gp+i.p+i and qp+i.p+1 respective i y . B y (EOTl) . for every U eV t and £7' G V V and 
every x G Tjj and x' G Ty/ we have 

(9.74) Q*' u = {x}x |J Y* and Q^' = {x'} x (J F//. 

s6f7(i ) s'eC/'(io) 

Thus, for every U GVt, 



(9.75) fit ^ {gr t/ :xEr c/ } E P{{x}x |J F< : x G F,,} 

seU(i ) 

{{x'}x |J r s *:x'er /(f7) } 

se(7(i ) 
se(7(io) 

{{x'}x y y;:x'er /(f7) } = Q5 



I9TH 
nil 



*'e/(tO(*o) 



Since / is a bijection we conclude that fiP +1,p+1 = fiP+ > p + . The proof of Claim 
19.141 is thus completed. □ 

By Claims [5771 up to 19.141 the pair (V p+ i, L p+ i) and the families Q^' p+1 con- 
structed in Steps 1, 2 and 3, satisfy all required conditions. This completes the 
recursive selection, and as we have already indicated, the proof of Lemma 19.11 is 
also completed. 

9.4. Consequences. In this subsection we will isolate what we get by iterating 
Lemma inHJ The resulting statement together with Corollary 18.61 form the basis of 
the proof of Theorem B. We proceed to the details. 

Let k G N with k ^ 2 and assume that for every integer / 1 and every 
< @ < 1 the number DCS(fc, I, (3) has been defined. We define ff:Nx(0,l]->N 
by #(0,7) = and 

(9.76) H(m, 7 ) = Reg(fc + 1, N Po + 1, 2, 7 2 /2) 



if m ^ 1, where po and N po are defined in (19.6[) and (|9.7I) respectively for the 
parameters m and 7. Next for every n G {0, k} we define H ( n > :Nx(0,l]->N 
recursively by the rule (to, 7) = and 

(9.77) ff(" +1 >(m,7) = #(#(") (m, 7), 7). 



GO 
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Finally, for every < 7 ^ 1 let 

(9-78) g = g(7)= ; ; 7 , 3 M - 

(2V2.32) 3 

We have the following corollary. It is an immediate consequence of Lemma 19.1 



Corollary 9.15. Let k G N with k ^ 2 and assume that for every integer I 1 
and every < /3 ^ 1 the number DCS(fc, I, 0) has been defined. 

Let < 7 ^ 1. Also let V be a Carlson-Simpson tree of [k + 1] <N and I be a 
nonempty subset of {0, dkri(V')}. Assume that we are given subsets A, D±, D k 
of [k + 1] <N with the following properties. 

(a) For every r G [k] the set D r is (r, k + 1) -insensitive in V . 

(b) We have dens y(l )(L>i n ... n D k n A) > (5 + j)dens V (i)(Di n ... n D k ) and 
densy(j)(Di fl ... I~l D k ) ^ 7 for every i E I. 

Finally, let m G N with m ^ 1 and suppose that 

(9.79) |/|>JT( fc >(m,0 

where and £ are defined in (|9.77|) and (|9.78[) respectively for the parameters 

m and 7. TTien f/iere exisi a Carlson-Simpson subtree W of V and a subset I 1 of 
{0, dim(M / )} 0/ cardinality m such that 

(9.80) dens w(i) (A) > <5 + £ 
/or every i G i 7 . 

10. Proof of Theorem B 

In this section we will complete the proof of Theorem B following the inductive 
scheme outlined in §8.1. Notice first that the numbers DCS(2, 1,8) are estimated 
in Proposition 17.11 It is then easy to see that, by induction on m and Corollary 
17.61 we may also estimate the numbers DCS(2, m, 5). 

Now we argue for the general inductive step. So let k G N with k 2 and assume 
that for every integer / ^ 1 and every < (3 ^ 1 the number DCS(fc, I, (3) has been 
defined. We fix < S «S 1. Let 771 be as in ([Q3| . Recall that 

(10.1) 77! 



120fc- |Subtn([/c]< A 
where A = [8(5- 1 DCS(fc, 1, 6/8)] . We set 



(2V2.32) 3 

and we define F$ : N — > N by the rule 

(10.3) F s (m) = G 1 {\r 1 ^{k + l)k-H^ k \m,g)\, r£/2) 
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where Gi and H^*' (m, g) are as in (|8.34[) and (|9.77|) respectively. The following 
proposition is the heart of the density increment strategy. It follows immediately 
by Corollary 18.61 and Corollary 19.151 

Proposition 10.1. Let k £ N with k ^ 2 and assume that for every integer I ^ 1 
and every < /3 ^ 1 the number DCS(fc, I, /?) has been defined. 

Let < S ^ 1 and L be a nonempty finite subset of N. Also let i C [k + 1] <N 
such that \An [k + 1]'| ^ 8{k + l) 1 for every I € L and assume that A contains no 
Carlson-Simpson line of [k + 1] <N . Finally, let m £ N with m 1 and suppose 
that \L\ ^ F$(m) where F$ is as in (|10.3[) . Then there exist a Carlson-Simpson 
tree W of [k + 1] <N and a subset I of {0, din^W^)} of cardinality m such that 
densn/^) (A) ^ 5 + g for every i £ / where g is as in (|10.2p . 

Using Proposition llO.il the numbers DCS(fc + 1, 1, S) can, of course, be estimated 
easily. In particular, we have the following corollary. 

Corollary 10.2. Let k e N with k ^ 2 and assume that for every integer I ^ 1 
and every < /3 ^ 1 the number DCS(fc, /,/3) has been defined. Then for every 
< 5 ^ 1 we have 

(10.4) DCS(fc+ 1,1,5) F^ e ^\l). 

Finally, just as in the case "fc — 2", the numbers DCS(fc + 1, m, S) can be 
estimated using Corollary 110.21 and Corollary 17.61 This completes the proof of 
the general inductive step, and so, the entire proof of Theorem B is completed. 

11. Consequences 

Our goal in this section is to prove several consequences of Theorem B. These 
include Theorem A and Theorem C stated in the introduction, as well as, an ap- 
propriate finite version of Theorem C. To state this finite version we need, first, to 
introduce some terminology. 

Recall that, given two sequences (p n ) and (w n ) of variable words over fc, we 
say that («i„) is of pattern (p„) if p n is an initial segment of w n for every n € N. 
This notion can, of course, be extended to finite sequences of the same length. 
Specifically, given two finite sequences {p n )™Zn an d (ru n )™=o of variable words 
over k, we say that (w^)™^ 1 is of pattern (p n )5£=o ^ Pn i s an initial segment of w n 
for every n£{0,...,m-l}. In particular, if p and w are variable words over k, then 
w is of pattern p if p is an initial segment of w. We have the following theorem. 

Theorem 11.1. For every integer k ^ 2, every nonempty finite sequence (t^)™^ 1 
of positive integers and every < 5 ^ 1 there exists an integer N with the following 
property. If (p n )™Zo * s a fi n ^ e sequence of variable words over k such that the 
length of p n is t„ for every n € {0, m— 1}, L is a finite subset ofN of cardinality 
at least N and A is a subset of [fc] <N satisfying dens^j? (A) ^ 5 for every £ E L, 
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then there exist a word c over k and a finite sequence (tOn)™^ 1 of variable words 
over k of pattern (p n )™=o such that the set 

(11.1) {c} U {c~Wo(ao)^ ...~w n (a n ) :ne{0, ...,m— 1} andao, ...,a n G [k]} 

is contained in A. The least integer N with the above property will be denoted by 
DP(Mr n )n=o\<5)- 

The proof of Theorem 1 1 1 . 1 1 will be given in §11.4. All necessary tools (beside, of 
course, Theorem B) are developed in the previous subsections. The corresponding 
infinite versions - that is, Theorem A and Theorem C - will be proved in §11.5. 
Finally, in §11.6 we discuss how one can derive the density Hales- Jewett Theo- 
rem and the density Halpern-Lauchli Theorem from Theorem B and Theorem A 
respectively. 

11.1. Sparse sets and regularity. We begin with the following definition. 

Definition 11.2. Let t G N with r 1 and L be a nonempty subset ofN. We say 
that L is r-sparse if for every 1,1' 6 L with I ^ /' we have \V — 1\^t. If L is a 
t -sparse subset of N, then we define the T-extension of L to be the set 

(11.2) (L) T = L + {0, r - 1} = {I + n : I G L and s? n < t - 1}. 

Every nonempty subset of N is 1-sparse and coincides with its 1-extension. Also 
notice that every subset of N of cardinality at least r(m — 1) + 1 contains a r- 
sparse subset of cardinality m. Finally observe that the notion of r-sparseness is 
hereditary, that is, every nonempty subset of a r-sparse set is T-sparse. 

Much of our interest in sparse sets is related to the following generalized version 
of Definition 13.11 

Definition 11.3. Let k G N with k ^ 2 and J 7 be a family of subsets of [fc] <N . Also 
let < e ^ 1, t G N with r ^ 1 and L be a r-sparse finite subset ofN. The family 
T will be called (e, r, L)-regular if for every n € L, every I C {I G L : I < n} and 
every y G [fc]' 7 ^ T we have 

(11.3) |dens({z G [fc]{™eN:m<„}\(/) T . ^ ^ g A ^ _ dcns ( A n [ fc ]«)| ^ £ 

We have the following analogue of Lemma 13.21 It is the main result of this 
subsection. 

Lemma 11.4. Let < e ^ 1 and k,£,q,r G N with k ^ 2 and£,q,r ^ 1. Then 
there exists an integer n with the following property. If N is a finite T-sparse subset 
ofN with \N\ ^ n and J- is a family of subsets of [fc] <N with \ J-\ = q. then there 
exists a subset L of N with \L\ = I such that T is (e, r, L)-regular. The least integer 
n with this property will be denoted by Heg T (k,£,q,s). 

The proof of Lemma 111.41 is similar to the proof of Lemma 13.21 and is based on 
the following consequence of Sublemma 13.71 
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Corollary 11.5. Let k,m,r,qeN with k > 2 and r, q > 1. Let < e < Ar r ( m+1 ) 
and N be a finite r-sparse subset of N with 

(11.4) |AT| Ss (<?Ll6e~ 4 J +l)(m + l) + l. 

Finally, let J- be a family of subsets of [fc] max W tuit/i J-] = q. Then there exists a 
subinterval M of N \ {max(iV)} with \M\ = m such that for every A G T , every 
subset I of (M) T and every y G [k] 1 we have |dens(Aj,) — dens(A) ^ e. 

Proof. We set N' = N \ {max(iV)}. Observe that 

(11.5) \(N') T U {max(iV)}| = t(N - 1) + 1 ^ r(m + l)(<?Ll6e~ 4 J + 1) + 1. 

By Sublemma 13. 7\ there exists a subinterval M' of (N') T with \M'\ — r(m + 1) 
such that for every A e J 7 , every subset I of M' and every y 6 [k] 1 we have 

(11.6) |dens(yy - dens(A)| ^ e. 

Since M' is subinterval of (N') T of cardinality r(m + 1), it is possible to select 
a subinterval M of N' of cardinality m such that (M) T C M'. Clearly M is as 
desired. □ 

We are ready to proceed to the proof of Lemma 111.41 

Proof of Leraraa \ll.Jf\ We set g = min{e, fc -T( ^ +1 )/2} and we define F : N — > N by 

(11.7) F(m) = (q[l6g-' l \ + l)(m + 1) + 1. 

Arguing precisely as in the proof of Lemma l3.2l and using Corollary 111.51 instead of 
Sublemma 13.71 we see that 

(11.8) Reg T (k,£,q,e)^F^(0) 

and the proof is completed. □ 

11.2. The (p, L)-restriction of [fc] <N . We are about to introduce a family of 
subsets of [fc] <N which are the analogues of Carlson-Simpson trees in the context 
of variable words of a fixed pattern p. 

Definition 11.6. Let k,T € N with k 2 and r 1. Let p be a variable word over 
k of length r and L = {Iq < ... < a T-sparse finite subset ofN. Recursively, 

for every i G {0, \L\ — 1} we define a subset R P: L{i) of [k] li as follows. We set 
R-p,l(0) = [k] l ° . Assume that R p ^(i) has been defined for some i G {0, \L\ — 2}. 
Then we set 

(11.9) R PtL (i + l) = {x~p{a)~y:xeR Pi L{i),a£ [k] and y G [fc]' l+1 "''" T }. 
We define the (p, L)-restriction of [k] <N to be the set 

\L\-1 

(11.10) R P , L = IJ Rp,i,(i). 

i=0 



64 



PANDELIS DODOS, VASSILIS KANELLOPOULOS AND KONSTANTINOS TYROS 



Notice that the (p, L)-restriction R Pi l is a rather "thin" subset of [fc] <N . So, if 
we are given a subset A of [k] <N it is likely that the density of A inside R p ,l will be 
negligible. This phenomenon, however, does not occur as long as A is sufficiently 
regular. In particular, we have the following lemma. Its proof is a straightforward 
consequence of the relevant definitions. 

Lemma 11.7. Let k 7 r £ N with k ^ 2 and r ^ 1, p be a variable word over k of 
length r and L = {Iq < ... < 1\l\-i} a r -sparse finite subset o/N. Let < e ^ 1 
and J 7 be a family of subsets of [k] <N which is (e,r, L)- regular. Then 

(11.11) \dens Rp L(l) (A) -dens [fc]ii (A)| e 
for every i G {0, \L\ — 1} and every A G T . 

We will need to parameterize the (p, L)-restriction R p x of [fc] <N in a "canonical" 
way. This is, essentially, the content of the following definition. 

Definition 11.8. Let k,r eN with k ^ 2 and r ^ 1. Let p be a variable word over 
k of length r and L — {Iq < ... < 1\l\-i} a r-sparse finite subset o/N. For every 
i G {0, \L\ — 1} we set l[ = li — ir + i and we define 

(11.12) L^) = {l' i :i = 0,...,\L\-l}. 
Recursively we define a bijection 

(11.13) $ P , L : |J [k} l ^Rp, L 

as follows. For every x G [k] l ° = [k] l ° we set $> p ,l( x ) = x. Let i G {0, \L\ — 2} and 
assume that $> P: l(v) has been defined for every y G [k] li . Then for every x G [fc]'^ 1 
we define 

(11-14) %x( x ) = $ P ,L( X lTP( a xT X 2 

where x\ — (x(0), x(l\ — 1)), a x = x{l'^) and X2 = {x(l[ + 1), x(l' i+1 — 1)) . 

We isolate, for future use, some elementary properties of the map & p ,l- 

Lemma 11.9. Let fc,r G N with k ^ 2 and r ^ 1. Let p be a variable word 
over k of length r and L = {Iq < ... < Z|l|-i} a r-sparse finite subset o/N. Let 
L« ={l>. :i = 0,...,|L|-l} be as in (| 11 . 1 2(1 . Then the following are satisfied. 

(i) For every i G {0, \L\ — 1} we have $ p ^ L ([k] l 'i) — R p , L {i). 

(ii) For every Carlson-Simpson line W of [fc] <N with L(W) C L^ its image 
®p,l(W) under the map <£> Pi l is of the form {c} U {c^w(a) : a G [k]} where 
c is a word over k and w is a variable word over k of pattern p. 

(hi) If J- is a family of subsets of [k] <N which is T is (e,r, L) -regular for some 
< s ^ 1, then for every i G {0, \L\ — 1} and every A G J- we have 

(11.15) |dens 1{ (%X(A)) - dens [k]h (A)\ < e. 
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Parts (i) and (ii) of Lemma 111.91 are immediate consequences of Definition 111.81 
Part (iii) follows easily by Lemma Til .71 We leave the details to the reader. 

11.3. Preliminary lemmas. As in (|7.15|) and (|7. 16|) for every k £ N with k ^ 2 
and every < 5 ^ 1 we set 

(11.16) A(k, 1, 5) = [c^DCS^, 1,(5)1 
and 

(11.17) Q(k,l,5) = 



Subtr 1 ([fc]< A ( fe . 1 ^))C 
We have the following analogue of Lemma 17.91 

Lemma 11.10. Let k G N with k ^ 2 and < p,j < 1. >Uso fe( t £ I wif/i t ^ 1 
and M be a finite subset of N with 

(11.18) |M|>r-Reg T (A;,A(fc,l,/07/8),l,p/2) 

where A(k,l, /8) is as in (j!1.16[) . Finally let B C [/c] <N urat/i dens^jn (_B) ^ p 
/or every n € M. If {A t : t 6 £?} is a family of measurable events in a probability 
space (O, S, /x) satisfying n(A t ) ^ 7 /or every t £ B, then for every variable word 
p over k of length r there exist a word c over k and a variable word w over k of 
pattern p such that, setting V — {c} U {c"w(a) : a G [k]}, we have V C B and 

(11.19) A t ) >e(fc,l,p 7 /8) 

tev 

where 0(fc, 1,^7/8) is as in (|11.17j) . 

Proof. By (| 1 1 . 18[) , we may select a r-sparse subset N of M with 

(11.20) |JV| ^Reg r (fc,A(fc,l,p7/8),l,p/2). 

By Lemma [11.41 there exists a subset L of N with |L| = A(fc, 1,^7/8) such that 
the family T := {B} is (p/2, r, L)-regular. 

Fix a variable word p over fc of length r. Let Z^ 1 "-* and ^p.L be as in Defini- 
tion HTHl We set B' = $~£(£). Since the singleton {B} is (p/2, r, L)-regular 
and denS[ fe ii(.B) ^ p for every I 6 L, by part (iii) of Lemma ["11. 9[ we get that 
dens [fc]! / (5') ^ p/2 for every I' e L^). Next for every t' E B' let C v = A <s>p L{t , ) . 
By our assumptions we have p(A t ) ^ 7 for every t <E B, and so, fi(Cf) ^ 7 for 
every i' G B'. Finally notice that \L^\ = \L\ = A(k, l,/r)</8). By the previous dis- 
cussion, we may apply Lemma 17.91 and we obtain a Carlson-Simpson line V 1 C B' 
with L(V) C L< T ) and such that 

(11.21) M ( f| C 4 ,) ^8(fc,l,p 7 /8). 

t'ev 

We set V = $ Pj l(V')' % P art ( n ) of Lemma [TT9l and (|11.2ip . we see that is as 
desired. The proof is completed. □ 
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To state the next result we need to introduce some numerical invariants. Specif- 
ically, for every fc, r G N with k ^ 2 and r ^ 1 and every < 5 ^ 1 we set 

(11.22) A P = A P (k,6) = A(fc,l,(5 2 /32) and 6 P = Q P (k,5) = 6(fc,l,5 2 /32) 
and we define h Tt s : N -> N by the rule 

(11.23) h T<s (n) = t ■ Reg T (k, A P , 1, 6/4) + ^Op 1 ■ n] . 
The following proposition corresponds to Proposition [73J 

Proposition 11.11. Let k G N with k ^ 2 and < <5 ^ 1 and define Ap and P 
as in (|11.22j) . j4/so let t e N with t ^ \, L be a nonempty finite subset of N and 
A be a subset of [fc] <N such that densj fc ]i (^4) ^ S for every I G L. Finally let n G N 
with n ^ 1 and assume that \L\ ^ h T ^(n) where h T ^ is as in (|11.23|l . Then, setting 
L to be the set of the first Ap elements of L, we have that either 

(i) there exist a subset L' of L\ Lq with \L'\ n and a word to € [fc] £ ° /or 
some G swc/i that 

(11.24) dens [fe] *-* D ({s G [fc] <N : t^s e A}) ^ 6 + S 2 /8 
for every £ G L' , or 

(ii) for every variable word p over k of length r there exist a word c over k, a 
variable word w over k of pattern p and a subset L" of L\Lq with \L"\ n 
such that the following are satisfied. 

(a) The set V = {c} U {c~w(a) : a G [fc]} is contained in {J (eLo ^ ^ Wf ■ 

(b) Setting V(l) = {c^w{a) : a G [fc]} and l\ the unique integer with 

C [fc]^ 1 , for every £ G L" we have 

(11.25) dens^-*! ({s G [fc] <N : fs G A for every t G V(l)}) ^ 9 P /2. 

The proof of Proposition 111.111 is identical to the proof of Proposition 17.51 using 
Lemma \l 1 . 101 instead of Lemma [7791 The details are left to the reader. 

We close this subsection with the following consequence of Proposition II 1 . 1 ll It 
is the main tool for the proof of Theorem C. 

Corollary 11.12. Let fc G N with k 2 and < 8 ^ 1. Also let L be an infinite 
subset of N and A be a subset of [fc] <N such that densrui(-A) (5 /or every I G L. 
TTien /or every variable word p over k there exist a word c over k, a variable word 
w over k of pattern p and an infinite subset L' of L with the following properties. 

(i) The set V — {c} U {c^w(a) : a G [fc]} is contained in {J eeL(i A D [k] £ where 
L a = {£eL:£< min(L')}. 

(ii) Setting V(l) = {c~w(a) : a G [fc]} and £\ the unique integer such that 
V(l) C [k) ei , for every £ G L' we have 

(11.26) dens [k]e - ei ({s G [fc] <N :fsei for every t G V(l)}) ^ 2~ 1 6 P (fc, (5/2) 
w/iere <d P (k,5/2) is as in (|11.22[) . 
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Proof. For every t G [fc] <N let A ( = {se [fc] <N : iTs € A} and define 

(11.27) St = limsupdensrj.i<-|t| (At). 

We set 5* = sup te r fe i<N St and we notice that S ^ S* ^ 1. Hence, we may select 
< Sq ^ 1, to G [fc] <N and an infinite subset M of L with min(M) > |io| such that 

(11.28) S/2 < S < 5* < S + SZ/8 
and 

(11.29) S < dens [fc ]«-i* i(A to ) 
for every £ € M. 

Fix a variable word p over fc and denote by r its length. Let Mo be the initial 
segment of M of cardinality 

(11.30) |Mo|=T-Reg T (fe,A P (Mo),l,W 

By the definition of S* and (|11.28p . there exists q € M \ Mo such that for every 
s G UeeMo [fc]^ - '' ' an d every £ £ M with £ ^ q we have 



(11.31) dens [fc]f _ |Cs| (A Cs )< ( 5o + ( 5^ 



We set d = [20p(fc, ^o) _1 l an d we select a sequence (E n ) of pairwise disjoint 
subsets of {m £ M : m ^ q} such that \E n \ — d for every n G N. 

Let n G N be arbitrary. We set F n = Mq U E n and we observe that 

(11.32) \F n \ = |M | + \E n \ ™ r • Reg r (A, A P (fc, 5 ), 1,5/4) + d ™ ft T , 5o (l). 

By (|11.32|) , we may apply Proposition 111.111 Notice that the first alternative of 
Proposition ll 1 . 1 il contradicts (jll.31j) . Hence, there exist a word c n over fc, a variable 
word w n over k of pattern p and m n G E n such that 

(11.33) V n = {c n } U {c> n (a) : a G [fc]} C (J A io n [fc] £ - |to1 

and, setting Q„ = {s G [fc] <N : G A to for every f G V^(l)}, 

mi.28t 

(11.34) dena [k]mn - in - ]toi (Q n )^2- 1 e P (k ) S ) > 2- 1 Q P (k,S/2) 

where V n (l) — {c~u>„(a) : a G [fc]} and £ n is the unique integer with V n (l) C [fc]^". 

By the classical pigeonhole principle, there exists an infinite subset N of N, a 
word c' over fc and a variable word w over fc of pattern p such that c n — d and 
u>„ = w for every n G N . We set L' = {rn n : n G N} and c = t^d. Using (| 1 1 . 33|) 
and (|11.34j) it is easy to see that L', c and w are as desired. □ 

11.4. Proof of Theorem 111.11 The proof proceeds by induction on m. For 
"m = 1" we notice that 

(11.35) DP(fc, t,S) s: r • Reg r (r, A(fc, 1, 5/8), 1, 5/2) 
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for every < 6 1 and every k, r G N with fc ^ 2 and t ^ 1. Indeed, let M be 
a finite subset of N with \M\ ^ r • Reg r (fc, A(fc, 1,(5/8), 1,6/2) and A be a subset 
of [fc] <N such that densr^in (A) ^ (5 for every n G M. Let (O, £,//) be an arbitrary 
probability space and set A t — ft for every t G A By Lemma 111.101 applied for 
> = 5" , "7 = 1" and a B = A" , we see that (flT35|) is satisfied. 

Let m G N with m ^ 1 and assume that for every integer k 2, every 
< /3 ^ 1 and every finite sequence (cr n )^ir °f positive integers the number 
DP(fc, (a n )™-Q,f3) has been defined. Let k ^ 2, < S 1 and r , r m G N with 
To, r m ^ 1 be arbitrary. We set r' n — r n+ i for every n G {0, m — 1} and 

(11.36) 7Vo = DP(fc,(rX=o,0p/2). 

We claim that 

(H-37) DP(fc,(r„)™ ,<J) < ft^" ai) (iVo). 

Clearly this will finish the proof. To see that (|11.37l) is satisfied let L be a finite 
subset ofNwith \L\ > h { T [*f 2]) (N a ) and A be a subset of [k] <N with dens [fc]! (A) ^ 8 
for every I G L. Also fix a finite sequence (pn)™=o of variable words over k with 
|i?nl = r n l° r every n G {0, m}. By our assumptions on the cardinality of the set 
L and repeated applications of Proposition 111.111 for "r = To" and "p = po" , we 
see that there exist a word c over k, a variable word w over fc of pattern p$ and a 
subset i" of L with 

(11.38) |L"| > N ^ DP(fc, (O^.Gp^) 

such that the following properties are satisfied. 

(a) The set F = {c} U {c"w(a) : a G [fc]} is contained in A Moreover, setting 
V(l) = {c^w(a) : a G [fc]} and l\ the unique integer with V(l) C [k] , we 
have £i < min(L"). 

(b) For every £ G L" we have 

(11.39) dens^/-^ (A) ^ 9 P /2 

where A = {s G [fc] <N : t~s G A for every t G V(l)}. 

For every n G {0, m — 1} we set p' n = p n +i and we notice that the length of p' n 
is T* n . Therefore, by (111.38)) and (|11.39p and our inductive assumptions, there exist 
a word c' over k and a finite sequence {w' n )™=o of variable words over k of pattern 

(Pn)n=0 SUCn tnat tne Set 

(11.40) {c'} U {c~w' (ao)^ ...~w' n (a n ) '■ n G {0, m — 1} and ao, a n G [fc]} 

is contained in A. We set wq — w^d and w n = w' n _ 1 for every n G [m]. It is easily 
verified that the set 

(11.41) {c} U {c~w (a )~ ...~w n (a n ) : n G {0, m} and a , a„ G [A;]} 
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is contained in A. This shows that (|11.37[) is satisfied and so the proof of Theorem 
111. H is completed. 

11.5. Proofs of Theorem A and Theorem C. As we indicated in the introduc- 
tion, Theorem A is a special case of Theorem C. Indeed, let k G N with k ^ 2 and 
set q n = (v) for every n G N. Notice that a sequence (w n ) of variable words over k 
consists of left variable words if and only if it is of pattern (q n ). Thus, Theorem A 
follows from Theorem C applied to the sequence {q n )- 

So, we only need to prove Theorem C. To this end, fix an integer k ^ 2 and a 
sequence (p n ) of variable words over k. Let < 8 ^ 1 and 4 C [fc] <N such that 

(11.42) Um sup ^^f^ 1 >6. 

n— >oo rv 

We fix an infinite subset L of N such that densru«(A) 5 for every £ G L. 
Recursively, we define a sequence (S n ) in (0, 1] by the rule 

(11.43) S = S and 6 n+1 = 2- 1 Q P {k, 6 n /2). 

Using Corollary II 1 . 121 we may select 

(i) a sequence (c n ) of words over k, 

(ii) a sequence (v n ) of variable words over k of pattern (p n ), 

(iii) a sequence (A n ) of subsets of [k] <N with A n = A and 

(iv) two sequences (L n ) and (L' n ) of infinite subsets of N 
such that for every n G N the following conditions are satisfied. 

(CI) The set L' n is contained in L„; moreover, Lq = L. 

(C2) The set V n — {c„} U {c~v n (a) : a G [fc]} is contained in \J eeL <> A n n [k] e 

where L° = {£ e L n : I < min(L^)}. 
(C3) Let V n (\) = {c~v n (a) : a G [k]} and £ n be the unique integer such that 

V n {\) C [k] tn . Then 

(11.44) A n+1 = {s G [fc] <N : rs G A n for every t G V n (l)} 
and 

(11.45) L n+1 = L' n -£ n = {l-£ n :leL' n }. 

(C4) For every £ G L„ we have denS[ fc j* (A n ) ^ <5„. 
The recursive selection is fairly standard and the details are left to the reader. 

We set c = Co and w n = v^c n +i for every n G N. By (ii) above, the sequence 
(w n ) is of pattern (p n ). Moreover, using conditions (C2) and (C3), it is easily 
verified that the set 

(11.46) {c} U {c'^wo(ao)'" ...~w n (a n ) : n G N and ao, a„ G [A;]} 
is contained in A. The proof of Theorem C is completed. 

11.6. Further implications. In this subsection we will discuss the relation of 
Theorem B and Theorem A with the density Hales- Jewett Theorem and the density 
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Halpern-Lauchli Theorem respectively. Notice, first, that the density Hales- Jewett 
Theorem follows from Theorem A via a standard compactness argument. In fact, 
we have the following finer quantitative information. 

Proposition 11.13. For every integer k ^ 2 and every < 5 1 we have 

(11.47) DHJ(fc, S) < DCS(fc, 1, 5). 

Proof. Let n DCS(fc, 1,5) and fix a subset A of [k] n with \A\ ^ 5k n . For every 
^ E [n] and every y E [fc]™~ £ let A y — {x E [k] e : y^x S A} and observe that 

(11.48) E Be [ fc ]„-idens(A„) = dens(A) > (5. 

Hence, for every £ e [n] we may select yg E [fc] n ~^ such that dens(A aj ,) ^ <5. We set 

(11.49) B = |J A ye 

ee[n] 

and we notice that dens^ (B) ^ 5 for every f e [n]. Since n ^ DCS(fc, 1, 5), there 
exists a Carlson-Simpson line R of [/c] <N which is contained in B. Let (c,w) be 
the generating sequence of R. Also let Ir € [n] be the unique integer such that the 
1-level i?(l) of R is contained in [k] lR . Then, setting 

(11.50) V = {ygc-w(a):ae[k}}, 

we see that V is a combinatorial line of [k] n and V C A. This shows that (|11.47p 
is satisfied, as desired. □ 

We proceed to discuss how one can deduce the density Halpern-Lauchli Theo- 
rem from Theorem A. The argument is well-known (see, e.g., 6 ( 29]) but we will 
comment on it for the benefit of the reader. 

Recall that a tree is a partially ordered set (T, <) such that the set {s G T : s < t} 
is finite and linearly ordered under < for every t 6 T. A tree T is said to be 
homogeneous if it is uniquely rooted and there exists and integer b ^ 2, called the 
branching number of T, such that every t £ T has exactly b immediate successors. 
A typical example of a homogeneous tree with branching number b ^ 2 is the 
set consisting of all finite sequence having values in a set A of cardinality b and 
equipped with the partial order of end-extension; it is denoted by A <N and can, of 
course, be identified with [b] <N . Part of the interest in homogeneous trees of this 
form is based on the fact that they can be used to "code" the level product 

(11.51) €5 (Ti, T d ) := \J T x {n) x ... x T d (n) 

nGN 

of a finite sequence (Ti, T^) of homogeneous trees. Specifically, we have the 
following lemma. 

Lemma 11.14. Let d E N with d > 1. Also let b = (&i, ...,b d ) E N d with b t ^ 2 
for every i E [d] and set Ab = [&i] x ... x \bd\- Finally let (Ti, T^) be a finite 
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sequence of homogeneous trees such that the branching number of Ti is bi for every 
i € [d\. Then there exists a bijection 

(11.52) $ b : A< N ->.®(ri,...,r d ) 

with the following properties. 

(i) For every n S N we have <J>b(A£) = Ti(n) x ... x T<j(n). 

(ii) For every c E A5 N and every sequence (w n ) of left variable words over Ab 
there exist strong subtrees (Si, Sd) of (T\, ...,Td) having common level 
set such that, setting 

(11.53) S = {c} U {c^u>o(ao)^ ...^w n (a n ) : n £ N and do, a„ G Ab}, 

we Ziaue $ b (S) = ®(Si, S d ). 

Proof. Let i G [d] be arbitrary and 77^ : Ab — > [bi] be the natural projection. Clearly 
we may assume that the tree Ti coincides with [&i] <N and so we may consider the 
"extension" 77; : A^ ~> Ti of 77i defined by 77i(0) = and 

(11.54) 77 4 ((a , ...,a„_i)) = (^(ao), 7Ti(a„_i)) 

for every integer n 1 and every (ao, a n _i) G A^. The map $b is then defined 
by the rule 

(11.55) *b(s) = (7fi (*),.-. ,*«»(*)). 

It is easily verified that 3>b is a bijection and satisfies all desired properties. □ 

With Lemma 111.141 at our disposal, let us see how Theorem A yields the den- 
sity Halpern-Lauchli Theorem. To this end, fix a finite sequence (Ti, T^) of 
homogeneous trees and a subset A of the level product of (Ti, Td) such that 

,„ ^ r |An(Ti(rc) x ... xT rf (n)) 

11.56 hmsup lrTn . , — . , > 0. 

V ; n^oc |Ti(n) x ... x T d (n)\ 

Let b = (bi, &d) where &i is the brunching number of the tree Ti for every i £ [rf] 
and consider the bijection $b obtained by Lemma Til. 141 We set B = $ b " 1 ( J 4). By 
part (i) of Lemma \1 1 . 141 and (|11.56|) . we see that 

(11.57) hmsup 1 '' bl >0. 

n— ^oo |^b| 

Hence, by Theorem A, there exist c € Aj^ and a sequence (u>„) of left variable 
words over Ab such that the set 

(11.58) {c} U {c~ wo(ao)'" ■ ■ w n (a n ) :n£fj and ao, a„ G Ab} 

is contained in B. Invoking the definition of the set B and part (ii) of Lemma 
111 . 141 we conclude that there exist strong subtrees (Si, Sd) of (Ti, Tj) having 
common level set such that the level product of (Si, Sd) is contained in A. 
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